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PEPTIDE EPITOPES RECOGNIZED BY ANTIGEN SPECIFIC CD4'' T 



LYMPHOCYTES 



Field of the Invention 



5 



The invention relates to the identification of 



naturally processed and presented HLA class II restricted 
peptides and their use as therapeutics and prophylactics. 



intact protein antigens by antigen presenting cells 
(APCs) , class II Major Histocompatibility Complex (MHC) 
molecules on the APCs bind short antigenic peptides 
(epitopes) derived from the antigens, presenting the bound 
15 peptides to 004"*" T lymphocytes [Germain, R.N. (1994), Cell 
76:287-299]. Class II MHC genes and the molecules they 
encode are highly variable among individuals, and 
differences among the 

class II MHC molecules determine which peptides are 
20 selected for presentation as T cell epitopes. 

Identification of the naturally processed peptide 
fragments of a polypeptide of interest that are presented 
by class II MHC molecules of a subject can be useful for 
developing peptides that regulate immune response to the 
25 polypeptide in that subject. 

Carcinoembryonic antigen (CEA) was first described by 
Gold and Freedman (1965), J. Exp. Med. 121:439-462. CEA 
is a highly glycosylated, 180, 000-dalton protein that is 
expressed on most gastrointestinal carcinomas, including a 
30 large number of primary and metastatic colorectal tumors. 
It is also found on some normal, endodermally derived 
tissues, though in much lower concentrations. CEA cell 
surface expression can be detected using the monoclonal 



10 



Background of the Invention 
After internalization and proteolytic processing of 
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antibody (mAb) Col-1, and CEA can be identified in total 
cellular protein by western blot analysis using the same 
mAb. An MHC class I {HLA-A2) -restricted cytotoxic 
T lymphocyte (CTL) response against a processed CEIA 
5 epitope has been described in patients with metastatic 
disease [Tsang et al. (1995), J. Natl. Cancer. Inst. 
87:982-990]. This MHC class I-restricted epitope was 
identified by taking peripheral blood lymphocytes from 
patients after immunization with a recombinant vaccinia 

10 virus encoding CEA and screening the T cell response 

against synthetic 9- to 11-mer CEA peptides. CTLs from 
three patients were shown to have specificity for a class 
I-re'stricted CEA epitope in an assay using autologous 
Epstein Barr Virus (EBV) -transformed B cells pulsed with a 

15. 9 amino acid synthetic peptide derived from CEA. 

Generally, HLA class Il-restricted epitopes are 
different from class I-restricted epitopes, and produce 
different immunological responses. Epitopes presented by 
HLA class II molecules are generally, although not 

20 exclusively, recognized by CD4'^ T cells. The HLA class II 
restricted CD4+ T cell response can act as a helper 
response, providing help to HLA class I-restricted CTL. 
Class Il-restricted responses may also result in the 
recruitment of cells such as tumoricidal macrophages and 

25 eosinophils that work in unison to eliminate neoplastic 
cells. Furthermore, recent evidence suggests that T 
helper cells can elicit an anti-ahgiogenic response in the 
microenvironment of a tumor mass, thereby cutting off the 
blood supply to the rapidly dividing cells. 

30 The potential role of CD4'*" T cells in antitumor 

immunity and cancer immunotherapy has been the subject of 
general review [Topalian (1994), Current Opinion in 
Immunology 6:741-745; Pardoll and Topalian (1998), Current 
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Opinion in Immunology 10:588-594; Toe et al. (1999), J. 
Exp- Med. 189:753-756]. Among other functions, 004"*" T 
cells provide help and immunological memory, can play an 
important role in initiating and inaintaining specific . 
5 anti-tumor immunity, and may be instrumental in the 
phenomenon of tumor-specific T cell tolerance. 

SUMMZVRY OF THE INVENTION 
The invention features methods for identifying 

10 peptide epitopes that activate 004* T lymphocyte responses 
involved in the initiation, promotion, or exacerbation of 
certain diseases. 

The invention is based on the discovery that one can 
identify HLA class Il-restricted epitopes naturally 

15 produced by APC transfected with DNA encoding a protein 
from which the epitopes are derived. These peptide 
epitopes can be incorporated into a drug delivery system 
and used to elicit an epitope-specif ic 004"*" T cell response 
in an appropriate mammalian subject. Where the epitope is 

20 derived from a tumor antigen, such a response promotes the 
migration of CDS"^ CTL to the vicinity of tumor cells 
expressing the tumor antigen and specific targeting of the 
tumor cells by the CTL. This CD4^ T lymphocyte response 
could also promote migration of tumoricidal macrophages 

25 and eosinophils to the vicinity of the tumor cells to help 
kill the tumor cells. Finally, the CD4*** T cell response 
can also produce an anti-angiogenic effect at the 
microenvironment of the tumor, cutting off the nutrient 
supply needed to continue cell division and tumor growth. 

30 Altered peptide ligands (APL) , which are class II 

MHC-binding variant peptides in which 1 to 6 amino acid 
residues are different from the correspondirig residues of 
the wild-type class II MHC-binding peptide, but which 
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Still bind to the same class II MHC molecules as the wild- 
type peptides, are also encompassed by the invention, as 
are methods of therapy and prophylaxis involving the use 
of APL. APL have the ability to elicit different patterns 
5 of cytokine production in 004**" T cells than do their parent 
wild- type peptides. Thus, for example, while a wild-type 
peptide presented by a class II MHC molecule may induce 
production of Thl cytokines, an APL derived from it and 
presented by the same class II MHC molecule may elicit Th2 

10 or other immunoregulatory cytokines. Alternatively, the 
wild-type peptide may stimulate the production of Th2 
cytokines while a corresponding APL elicits production of 
Thl cytokines. 

More specifically, the invention features a method of 

15 identifying a class II MHC-binding fragment of a 
polypeptide- The method involves: (a) providing a 
mammalian antigen-presenting cell (APC) cell line 
transfected with a DNA encoding a polypeptide; (b) 
isolating from the APC a class II MHC molecule bound to a 

20 peptide, the peptide being a class II MHC-binding fragment 
of the polypeptide; (c) eluting the peptide from the class 
II MHC molecule; and (d)' identifying the peptide as a 
fragment of the polypeptide. The polypeptide can have the 
sequence of a tiomor antigen (e.g., CEA) . The APC can be 

25 any class II MHC-expressing mammalian cell, e.g., a 
dendritic cell, a macrophage, a monocyte or a B 
lymphocyte, and the mammal from which it is derived can be 
a human. The class II MHC molecule can be a HLA-DR 
molecule, or a HLA-DQ molecule, or a HLA-DP molecule. 

30 Such a DR molecule can have a p-chain encoded by a 

DRB1*0401, DRB1*0101, DRB1*0301, DRB1*0701, DRB1*1101, 
DRB1*1301, DRB1*1302, DRB1*1501, or DRB5*0101 gene. 
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The invention provides an isolated peptide fewer than 
35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, 
or 14) amino acid residues in length and including the 
sequence KEVLLLVHNLPQH (SEQ ID N0:1). The peptide thus 
5 can include or be the amino acid sequence KEVLLLVHNLPQHL 
(SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID N0:3); 
KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL (SEQ ID 
NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6); or 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13). Also embraced by the 

10 invention is a peptide fewer than 35 (e.g., fewer than 33, 
30, 26, 22, 19, 18, 17, 16, 15, or 14) amino acid residues 
in length that includes or is the sequence YLWWVNGQSLPVSPR 
(SEQ ID NO: 7), optionally with additional CEA sequence on 
one or both ends. The invention also includes a peptide 

15 fewer than 35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 

17, 16, 15, or 14) amino acid residues in length and 
including the sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 
The peptide thus can include or be the amino acid sequence 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14) or NPPAQYSWLIDGNIQQHT 

20 (SEQ ID NO: 15) . 

Another embodiment of the invention is an altered 
peptide ligand (APL) the amino acid sequence of which is 
identical, except for 1-6 amino acid substitutions, to a 
fragment of carcinoembryonic antigen (CEA) , the fragment 

25 being fewer than 35 (e.g., fewer than 33, 30, 26, 22, 19, 

18, 17, 16, 15, or 14) amino acids residues in length and 
including or being the sequence KEVLLLVHNLPQH (SEQ ID 
N0:1), YLWWVNGQSLPVSPR (SEQ IDN0:7), or 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14), optionally with 

30 additional CEA sequence on one or both ends. No more than 
30% of the amino acid residues of the fragment are 
substituted with different amino acid residues in the APL, 
and the APL can bind to a class II MHC molecule. The 
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sequence of the fragment can be, for example, 
KEVLLLVHNLPQH (SEQ ID N0:1), YLWWVNGQSLPVSPR (SEQ ID 
N0:7), or NPPAQYSWLIDGNIQQH (SEQ ID N0:14), optionally 
with additional CEA sequence on one or both ends. In this 
5 method, the mammal can be one suspected of having or being 
susceptible to cancer. 

The invention also features a process for making an 
APL. The method involves: (a) carrying out the above 
method of identifying a class II MHC-binding fragment of a 

10 polypeptide, and (b) synthesizing an APL consisting of a 
sequence which is identical to that of the peptide, except 
having amino acid substitutions atl, 2, 3, 4, 5, or 6 
positions in the peptide. The polypeptide can be a tiomor 
antigen such as CEA. 

15 Another aspect of the invention is a method of 

activating T cell reactivity in a mammal. The method 
involves: (a) providing (i) a peptide, the sequence of 
which consists of the sequence of a naturally processed 
fragment of CEA, the peptide being capable of binding to a 

20 class II MHC molecule of the mammal and of eliciting a CD4'*^ 
T cell response, or (ii) a DNA encoding a polypeptide that 
is (1) the peptide, (2) the peptide plus an amino terminal 
methionine residue, and (3) either (1) or (2) linked to a 
trafficking sequence; and (b) administering the peptide or 

25 DNA to the mammal. The peptide can include or be 

KEVLLLVHNLPQH (SEQ ID N0:1), KEVLLLVHNLPQHL (SEQ ID NO: 2), 
KEVLLLVHNLPQHLF (SEQ ID NO: 3), KEVLLLVHNLPQHL FG (SEQ ID 
NO: 4), GKEVLLLVHNLPQHL (SEQ ID NO: 5), EGKEVLLLVHNLPQHLFG 
(SEQ ID N0:6), YLWWVNGQSLPVSPR (SEQ ID N0:7), 

30 EGKEVLLLVHNLPQHL (SEQ ID NO: 13), NPPAQYSWLIDGNIQQH (SEQ ID 
N0:14), or NPPAQYSWLIDGNIQQHT (SEQ ID N0:15), optionally 
with additional CEA sequence on one or both ends. 
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The invention also provides a method of altering a T 
cell response in a mammal. The method involves: (a) 
providing (i) an APL having a sequence identical, except 
for amino acid sxibstitutions at 1-6 positions, to the 
5 sequence of a naturally-processed fragment of CEA, the APL 
being able to bind to a class II MHC molecule of the 
mammal, or (ii) a DNA encoding a polypeptide that can be 
(1) the APL, (2) the APL plus an amino terminal methionine 
residue, and (3) either (1) or (2) linked to a trafficking 
10 sequence; and (b) administering the APL or DNA to the 
mammal . 

The above described method of identifying a class II 
MHC-binding fragment of a polypeptide can include the 
additional steps of: (e) providing CD4'*' lymphocytes from a 

15 mammal having a condition suspected of being associated 
with presentation of the peptide by the class II MHC 
molecule, the APCs of the mammal bearing the class II MHC 
molecule; (f) providing a population of APCs that bear the 
class II MHC molecule with the peptide bound thereto; (g) 

20 contacting the population of APCs of (f) with the CD4'*' 
lymphocytes of (e) ; and (h) determining whether the CD4'*" 
lymphocytes recognize the class II MHC-bound peptide, as 
an indication that presentation of the peptide to CD4'*" T 
lymphocytes is associated with the condition. The 

25 presentation can be associated with a pathological 

response of CD4'*' T lymphocytes or a protective response of 
CD4'*' T lymphocytes. 

Another aspect of the invention is a method of 
diagnosis that involves: (a) providing a CD4'*" lymphocyte 

30 from an individual suspected of having or being 

susceptible to cancer; (b) providing an APC which bears on 
its surface a class II MHC molecule of an allele identical 
to one expressed by the individual, the class II MHC 
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molecule being bound to a CEA peptide; (c) contacting the 
APC with the CD4*** lymphocyte; and (d) determining whether 
the 004"*" lymphocyte recognizes the class II MHC-bound 
peptide, as an indication that the individual has or is 

5 susceptible to cancer. The peptide can include or be the 
amino acid sequence KEVLLLVHNLPQH {SEQ ID N0:1); 
ECEVLLLVHNLPQHL (SEQ ID NO: 2;) KEVLLLVHNLPQHLF (SEQ ID 
NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 
(SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6); 

10 YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL (SEQ ID 
N0:13); NPPAQYSWLIDGNIQQH (SEQ IDN0:14); or 
NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15), optionally with 
additional CEA sequence on one or both ends- 

The invention also features a method of treating a 

15 subject suspected of having or being susceptible to 

cancer- The method involves administering a peptide to 
the subject- The peptide can be fewer than 35 (e.g-, 
fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, or 14) 
amino acid residues in length and include a sequence 

20 KEVLLLVHNLPQH (SEQ ID N0:1), optionally with additional 
CEA sequence on one or both ends. Alternatively, the 
peptide can be fewer than 35 (e.g., fewer than 33, 30, 26, 
22, 19, 18, 17, 16, 15, or 14) amino acid residues in 
length and include or be the sequence YLWWVNGQSLPVSPR (SEQ 

25 ID N0:7), optionally with additional CEA sequence on one 
or both ends- Furthermore, the peptide can be fewer than 
35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, 
or 14) amino acid residues in length and include or be the 
sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14), optionally with 

30 additional CEA sequence on one or both ends. 

The invention also provides a method of identifying a 
reagent for diagnosing cancer. The method involves: 
(a) providing a test reagent that can be a Fab fragment, a 
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monoclonal antibody (mAb) , or a single chain Fv (scFv) 
fragment; (b) providing a complex that contains a class II 
MHC molecule bound to a peptide that includes or is 
KEVLLLVHNLPQH{SEQ ID N0:1); KEVLLLVHNLPQHL (SEQ ID N0:2); 
5 KEVLLLVHNLPQHLF (SEQ ID NO: 3); KEVLLLVHNLPQHLFG (SEQ ID 
NO: 4); GKEVLLLVHNLPQHL (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG 
(SEQ ID N0:6); YLPffWVNGQSLPVSPR (SEQ ID NO:7); 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID 
NO; 14); or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15); and 

10 (c) testing whether the test reagent binds to the complex. 
The class II MHC molecule can be a DR molecule with a p- 
chain encoded by a DRB1*0401 gene or by a DRB1*0101 gene. 
As used herein, a ^^scFv" fragment is a recombinant 
fragment of an antibody molecule that contains, in a 

15 single polypeptide chain, the antigen-binding regions of 
an immunoglobulin (Ig) heavy and an Ig light chain. scFv 
fragments generally either contain (a) no Ig heavy or Ig 
light chain constant regions or (b) less than the whole 
constant region of an Ig heavy and/or Ig light chain. 

20 Also embraced by the invention is a method of 

diagnosis. The method involves: (a) providing a test cell 
from a mammalian subject; (b) providing a reagent that 
binds to a CEA peptide fragment bound to a class II MHC 
molecule; (c) contacting the test cell with the reagent; 

25 and (d) detecting binding of the reagent to the test cell 
as an indication that the test cell is a cancer cell- 

The invention also features a method of cancer 
therapy that involves: (a) providing a composition that 
includes a reagent that can be a Fab fragment, a mAb, or a 

30 scFv fragment, the reagent being able to recognize a 

naturally processed CEA peptide bound to a MHC class II 
moleciale and being linked to an agent that can be a 
chemotherapeutic compound, a radioactive isotope or a 
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toxin; and (b) administering the composition to a subject 
suspected of having or being susceptible to a cancer 
characterized by expression of CEA. 

Another aspect of the invention is a method of 
5 identifying a class II MHC-binding fragment of a tumor 
antigen. The method involves: (a) providing a mammalian 
APC that contains a class II MHC molecule and the tumor 
antigen; (b) isolating from the APC the class II MHC 
molecule bound to a peptide, the peptide being a class II 

10 MHC binding fragment of the tvimor antigen; (c) eluting the 
peptide from the class II MHC molecule; and (d) 
identifying the amino acid sequence of the peptide. The 
method can involve the additional steps of: (e) providing 
CD4'^ lymphocytes from a mammal having a cancer suspected of 

15 being associated with presentation of the peptide by the 
class II MHC molecule, the APCs of the mammal bearing the 
class II MHC molecule; (f) providing a population of APCs 
that bear the class II MHC molecule with the peptide bound 
thereto; (g) contacting the population of APCs of (f) with 

20 the CD4'*' lymphocytes of (e) ; and (h) determining whether 
the CD4'*' lymphocytes recognize the class II MHC bound 
peptide, as an indication that presentation of the peptide 
to CD4^ lymphocytes is associated with the cancer. 

The invention also provides an isolated DNA that 

25 contains a nucleotide sequence that encodes a peptide 
fewer than 35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 
17, 16, 15, or 14) amino acid residues in length, and that 
contains the sequence KEVLLLVHNLPQH (SEQ ID N0:1). The 
peptide encoded by the DNA thus can include or be, for 

30 example, the amino acid sequence KEVLLLVHNLPQHL (SEQ ID 
NO: 2); KEVLLLVHNLPQHLF (SEQ ID NO: 3); KEVLLLVHNLPQHL FG 
(SEQ ID NO: 4); GKEVLLLVHNLPQHL (SEQ ID NO: 5); 
EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6); or EGKEVLLLVHNLPQHL (SEQ 
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ID NO: 13), optionally with additional CEA (or non-CESA) 
sequence on one or both ends. Also embraced by the 
invention is DNA encoding a peptide fewer than 35 (e.g., 
fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, or 14) 
5 amino acid residues in length that includes or is the 
sequence YLWWVNGQSLPVSPR (SEQ ID NO: 7), optionally with 
additional CEA sequence on one or both ends. The 
invention also features an isolated DNA that contains a 
nucleotide sequence that encodes a peptide fewer than 35 

10 (e.g., fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, or 
14) amino acid residues in length, and that contains the 
sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). The peptide 
encoded by the DNA thus can include or be, for example, 
the amino acid sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14), 

15 or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15), optionally with 
additional CEA sequence on one or both ends. Also 
featured is a vector containing the above DNA of the 
Invention. In the vector the nucleotide sequence can be 
operatively linked to a transcriptional regulatory 

20 element. Also included in the invention is a cell (e.g., 
a mammalian, an insect, a bacterial, a yeast, or a fungal 
cell) containing any of the vectors of the invention. 

Another aspect of the invention is an isolated 
peptide containing a fragment of CEA that includes: 

25 (a) KEVLLLVHNLPQH (SEQ ID N0:1) and a further 1-15 

residues of CEA sequence at the C-terminus of SEQ ID N0:1, 
at the N-terrainus of SEQ ID N0:1, or at both the N- 
terminus and the C-terminus of SEQ ID N0:1; 
(b) YLWWVNGQSLPVSPR (SEQ ID NO: 7) and a further 1--15 

30 residues of CEA sequence at the C-terminus of SEQ ID NO: 7, 
at the N-terminus of SEQ ID NO: 7, or at both the N- 
terminus and the C-terminus of SEQ ID NO: 7; or (c) 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14) and a further 1-15 
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residues of CEA sequence at the C- terminus of SEQ ID 
NO: 14, at the N- terminus of SEQ ID NO: 14, or at both the 
N-terminus and the C-terminus of SEQ ID NO: 14. This 
peptide can be used in a method of activating T cell 
5 reactivity in a mammal. The method involves: (a) 

providing (i) any of the peptide or (ii) a DNA encoding a 
polypeptide which can be (1) the peptide, (2) the peptide 
plus an amino terminal methionine residue, or (3) either 
(1) or (2) linked to a trafficking sequence; and (b) 

10 administering the peptide or DNA to the mammal. 

Another embodiment of the invention is an isolated 
fragment of CEA shorter than full-length CEA; the fragment 
includes one or more amino acid sequences selected from 
the group consisting of SEQ ID N0:1, SEQ ID NO: 7, and SEQ 

15 ID NO: 14. This fragment of CEA can be used in a method of 
activating T cell responsiveness in a mammal. The method 
involves: (a) providing (i) the CEA fragment, or (ii) a 
DNA encoding a polypeptide that can be (1) the CEA 
fragment, (2) the CEA fragment plus an amino terminal 

20 methionine residue, and (3) either (1) or (2) linked to a 
trafficking sequence, the DNA not encoding full-length 
CEA; and 

(b) administering the fragment or DNA to the mammal. 

Another method of activating T cell responsiveness in 

25 a mairanal involves: (a) administering to the mammal 

(i) CEA or (ii) a DNA encoding CEA; and (b) testing CD4+ T 
cells of the animal for responsiveness to a naturally 
processed fragment of CEA (e.g., fragments with SEQ ID 
N0S:l-7 and 13-15) . 

30 The invention also provides an isolated peptide that 

includes: (a) at least one CEA segment that can be a 
segment with SEQ ID N0:1, SEQ ID NO: 7, and SEQ ID NO: 14; 
and (b) one or more cunino acids at the C-terrainus of the 
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CEA segment, at the N-terminus of the CEA segment, or at 
both the C- terminus and the N-terminus of the CEA segment, 
the peptide having an amino acid sequence that is not 
identical to that of full-length CEA. This peptide can be 

5 used in a method of activating T cell reactivity in a 
mammal. The method involves: (a) providing (i) the 
peptide, or (ii) a DNA encoding a polypeptide that can be 
(1) the peptide, (2) the peptide plus an amino terminal 
methionine residue, and (3) either (1) or (2) linked to a 

10 trafficking sequence, the DNA not encoding full-length 
CEA; and (b) administering the peptide or DNA to the 
mammal . 

An "isolated" peptide of the invention is a peptide 
that either has no naturally-occurring counterpart (e.g., 

15 an APL) , or has been separated or purified from components 
which naturally accompany it,- e.g., in tissues such as 
pancreas, liver, spleen, ovary, testis, muscle, joint 
tissue, neural tissue, gastrointestinal tissue, or body 
fluids such as blood, serum, or urine- Such a naturally- 

20 occurring peptide is considered "isolated" when it is at 
least 70%, by dry weight, free from the proteins and 
peptides with which it is naturally associated. 
Preferably, a preparation of a peptide of the invention is 
at least 80%, more preferably at least 90%, and most 

25 preferably at least 99%, by dry weight, the peptide of the 
invention. Since a peptide that is chemically synthesized 
is, by its nature, separated from the components that 
naturally accompany it in the tissue where it occurs 
naturally, the synthetic peptide is, by definition, 

30 "isolated." 

An isolated peptide of the invention can be obtained, 
for example, by extraction from a natural source (e.g., 
from human tissues or bodily fluids) ; by expression of a 
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recombinant nucleic acid encoding the peptide; or by 
chemical synthesis. A peptide that is produced in a 
cellular system different from the source from which it 
naturally originates is "isolated," because it will be 
5 separated from components which naturally accompany it. 
The extent of isolation or purity can be measured by any 
appropriate method, e.g., column chromatography, 
polyacrylamide gel electrophoresis, mass analysis, or HPLC 
analysis. 

10 An "isolated DNA" means DNA free of the genes that 

flank the gene of interest (e.g., the gene encoding CEA) 
in the genome of the organism in which the gene of 
interest naturally occurs. The term therefore includes a 
recombinant DNA incorporated into a vector, into an 

15 autonomously replicating plasmid or virus, or into the 
genomic DNA of a prokaryote or eukaryote. It also 
includes a separate molecule such as: a cDNA where the 
corresponding genomic DNA has introns and therefore a 
different sequence; a genomic fragment; a fragment 

20 produced by polymerase chain reaction (PGR) ; a restriction 
fragment; a DNA encoding a non-naturally occurring 
protein, fusion protein, or fragment of a given protein; 
or a nucleic acid which is a degenerate variant of a 
naturally occurring nucleic acid. In addition, it 

25 includes a recombinant nucleotide sequence that is part of 
a hybrid gene, i.e., a gene encoding a fusion protein. 
Also included is a recombinant DNA that includes a DNA 
sequence that encodes any of the peptides with SEQ ID 
N0S:l-7 and 13-15. 

30 As used herein, "protection from a mammalian disease" 

means prevention of onset of a mammalian disease or 
lessening the severity of a disease existing in a mammal. 
"Prevention" can include a delay of onset, as well as a 
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partial or complete block in progress of the disease or 
disease symptoms - 

As used herein, "a naturally-processed peptide 
fragment" is a peptide fragment produced by proteolytic 

5 degradation of a protein in an antigen presenting cell of 
a mammal- As used herein, a "tumor antigen" is a molecule 
(e.g., a protein molecule) that is expressed by a tumor 
cell. Such a molecule can differ (e.g., by one or more 
amino acid residues where the molecule is a protein) from, 

10 or it can be identical to, its counterpart expressed in 
normal cells. It is preferably not expressed by normal 
cells. Alternatively, it is expressed at a higher level 
(e.g., a two-fold, three-fold, five-fold, ten-fold, 20- 
fold, 40-fold, 100-fold, 500-fold, 1,000-fold, 5,000-fold, 

15 or 15,000-fold higher level) in a tumor cell than in the 
tumor cell's normal counterpart. Examples of tumor 
antigens include, without limitation, CEA, prostate 
specific antigen (PSA), MAGE (melanoma antigen) 1-4, 6 and 
12, MUC (mucin) (e.g., MUC-1, MUC-2, etc.), tyrosinase, 

20 MART (melanoma antigen) , Pmel 17 (gplOO) , GnT-V intron V 
sequence (N-acetylglucoaminyltransferase V intron V 
sequence). Prostate Ca psm, PRAME (melanoma antigen), p- 
catenin, MUM-l-B (melanoma ubiquitous mutated gene 
product), GAGE (melanoma antigen) 1, BAGE (melanoma 

25 antigen) 2-10, C-ERB2 (Her2/neu) , EBNA (Epstein-Barr Virus 
nuclear antigen) 1-6, gp75, human papilloma virus (HPV) E6 
and E7, p53, lung resistance protein (LRP) , Bcl-2, and Ki- 
67. Recognition of such a peptide by 004**" T cells of a 
mammal (e.g., a human patient) is indicative of the 

30 existence, or future onset, of cancer in the mammal. 
Unless otherwise defined, all technical and 
scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to 
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which this invention pertains. In case of conflict, the 
present document, including definitions, will control. 
Preferred methods and materials are described below, 
although methods and materials similar or equivalent to 
5 those, described herein can be used in the practice or 
testing of the present invention- Unless otherwise 
indicated, these materials and methods are illustrative 
only and are not intended to be limiting. All 
publications, patent applications, patents and other 
10 references mentioned herein are incorporated by reference. 

other features and advantages of the invention, e.g., 
methods of identifying peptides that activate CD4* T 
lymphocyte responses, will be apparent from the following 
description, from the drawings and from the claims. 

15 

Brief Description of the Drawings 
Fig. lA is a depiction of the nucleotide sequence of 
CEA-encoding cDNA (SEQ ID NO: 12) that was inserted into 
the expression vector pcDNA3.1-Zeo. 
20 Fig. IB is a depiction of the amino acid sequence of 

CEA (SEQ ID NO: 17) encoded by the cDNA that was inserted 
into the expression vector pcDNA3. 1-Zeo. 

Fig. 2 is a histogram showing the fluorescence- 
activated flow cytometric profiles of JY cells that were 
25 stained with CEA specific murine mAb and fluorescein 

isothiocyanate (FITC) -conjugated goat antibody specific 
for murine immunoglobulin. 

Fig. 3 is a diagram showing two appropriately aligned 
matrix assisted laser desorption time-of-f light (MALDI- 
30 TOF) mas spectometry spectra derived from two mixtures of 
peptides in which separate aliquots of peptide mixtures 
prepared from A2- 140-pcDNA3 and A2 , 140-CEA-38 cells were 
analyzed by IMF procedures. 
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Fig. 4 is a MS/MS Spectrum of naturally processed 
KEVLLLVHNLPQH (SEQ ID NO: 1) . The two background 
responses were derived from a peptide that co-eluted 
chromatographically with KEVLLLVHNLPQH (SEQ ID NO: 1) and 
5 was closely related in M/Z. These peptides (KEVLLLVHNLPQH 
(SEQ ID NO: 1) and the co-eluting species) could be only 
partially resolved chromatographically, but this did not 
prevent the identification of KEVLLLVHNLPQH (SEQ ID NO: 1) 
from the resultant composite mass spectrum. Spectral 
10 labeling in Figs. 3-9 use the nomenclature described by 
Roepstorff and Fohlman (1988), Biomed. Mass. Spectrom. 
11:601, and Biemann (1990), Meth. Enzymol. 193:866-867. 

Fig. 5 is a MS /MS spectrum of naturally processed 
KEVLLLVHNLPQHL (SEQ ID NO: 2) 
15 Fig. 6 is a MS/MS spectrxmi of naturally processed 

KEVLLLVHNLPQHLF (SEQ ID NO: 3). 

Fig. 7 is a MS/MS spectrum of naturally processed 
KEVLLLVHNLPQHLFG (SEQ ID NO: 4). 

Fig. 8 is a MS/MS spectrum of naturally processed 
20 GKEVLLLVHNLPQHL (SEQ ID NO: 5). 

Fig. 9 is a MS/MS spectrum of naturally processed 
EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6). 

Fig. 10 is a MS /MS spectrum of naturally processed 
YLWWVNGQSLPVSPR (SEQ ID N0:7) . 
25 Fig. 11 is a MS /MS spectrum of naturally processed 

EGKEVLLLVHNLPQHL (SEQ ID NO: 13). 

Fig. 12 is a MS/MS spectrum of naturally processed 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 

Fig. 13 is a MS /MS spectrum of naturally processed 
30 NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 
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DETAILED DESCRIPTION 
The present invention is based on the novel discovery 
that, by transfecting an APC with an expression vector 
5 containing a DNA sequence encoding a polypeptide of 

interest (PPI)/ the APC will synthesize the PPI and then 
transport it to one of the antigen-processing organelles 
within the cell, e.g., endosomes, lysozomes and structures 
designated "MHC class II compartments" (MIIC) that have 

10 lysosomal characteristics and are enriched for class II 
MHC molecules but are substantially devoid of class I MHC 
molecules [Peter et al. (1991), Nature 349:669-676]. The 
PPI is degraded by proteolytic enzymes into peptide 
fragments. If any of these peptide fragments has, by 

15 virtue of its length and sequence, the ability to bind to 
one of the class II MHC molecules expressed by the APC, it 
will do so in the antigen processing organelle. The 
resulting peptide-class II MHC molecular complex is then 
transported to the APC's cell membrane, where it becomes 

20 available for interaction with CD4'*' T cells bearing 

antigen-specific receptors that specifically recognize 
that particular peptide-class II MHC complex. By eluting 
peptides from class II MHC molecules isolated from these 
APC, a set of naturally processed peptides derived from 

25 the PPI, as well as from other polypeptides of 

intracellular or extracellular origin, is obtained. The 
peptides, which are specific to the particular types 
(isotypes and alleles) of class II MHC molecules expressed 
by the APC, are then chemically separated and their amino 

30 acid sequences determined. By comparison of the peptide 
amino acid sequences to the sequence of the PPI, it is 
possible to identify those which are derived from the PPI. 
Thus, the present invention includes a method of 
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identifying peptide fragments that are naturally processed 
by APC and have intrinsic binding affinity for the 
relevant class II MHC molecule. The method can be 
invaluable for identifying peptides derived from a 
5 polypeptide suspected of being an antigen that activates 
CD4'*' T cells involved in either (a) the pathogenesis 
(pathology) of a disease, especially one in which 
susceptibility or protection is known to be associated 
with expression of a particular type of class II MHC 

10 molecule, or (b) prevention or reduction of the symptoms 
of a disease, especially one in which protection or a 
reduction in severity is associated with expression of a 
particular type of class II MHC molecule. The method is 
designated "Immunological Mass Fingerprinting" ("IMF"). 

15 The described method ensures that the peptides 

identified are those that both (i) are naturally processed 
in vivo by the APC, and (ii) become associated, in the 
APC, with the relevant class II MHC molecules. 

Furthermore, the present method controls for class II 

20 MHC type, an important aspect essential to link any given 
peptide to a particular CD4'*' T cell-mediated disease in a 
given individual but especially important in disorders in 
which class II MHC type determines disease susceptibility 
or resistance. 

25 Any naturally processed peptide with a sequence that 

corresponds to a fragment of the PPI, and which binds to a 
class II MHC molecule associated with the disease of 
interest, could be a peptide that activates CD4'*" T cells 
that either initiate, promote, or exacerbate the disease 

30 or mediate immunity to it. To obtain confirmatory 
evidence of this possibility, test CD4"*" T cells from 
subjects expressing the relevant class II MHC molecules 
can be assayed for responsiveness to a peptide identified 
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in accordance with the invention. Control CDA* T cells can 
be from subjects also expressing the class II MHC molecule 
but without symptoms of the disease. A significant 
response of the test 004* T cells and no or little response 
5 of the control 004"*" T cells would indicate that the 

relevant peptide is involved either in the disease process 
(pathology of the disease) or in immunity to the disease. 
The cellular response phase of the method is designated 
"Epitope Verification" ("EV") . 

10 By applying the methods of the invention to the tumor 

antigen CEA, CEA-derived peptides were identified as 
epitopes that could be involved in the pathogenesis of 
cancer in human patients expressing the DR4 or DRl class 
II MHC allele. Based on their amino acid sequences, these 

15 peptides fall into 3 nested groups. A consensus peptide 
corresponding to the core region of each nested group can 
be synthesized and tested for its ability to activate CD4"*" 
T cells from either DR4, DRIS, or DRl-expressing cancer 
patients or DR4, DR13, or DRl-expressing subjects without 

20 disease symptoms. 

The methods of the invention can be applied to 
identifying peptides involved in the pathogenesis of or 
protection from any of a wide range of diseases, 
especially those in which relative susceptibility or 

25 resistance has been associated with expression of a 

particular class II MHC allele, provided that the amino 
acid sequence (or partial amino acid sequence) of a 
suspect polypeptide antigen is available. Candidate 
diseases include, without limitation, infectious diseases 

30 (e.g., diseases caused by Chlamydia trachomatisr 

Helicobacter pylori^ Neisseria meningitidis ^ Mycobacterium 
leprae^ Af. tuberculosis, Measles virus, hepatitis C virus, 
human immunodeficiency virus, and Plasmodium falciparium) , 
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cancer (e.g. melanoma, ovarian cancer, breast cancer, 
colon cancer and B cell lymphomas) [Topalian (1994), Curr. 
Opinion in Immunol, 6: 741-745; Topalian et al. (1996), J. 
Exp. Med. 183: 1965-1971], and autoimmune diseases (e.g., 
5 insulin dependent diabetes mellitus, rheumatoid arthritis, 
multiple sclerosis, myasthenia gravis, and systemic lupus 
erythmatosus ) . 

The invention also includes peptides derived from CEA 
using the above method, as described in Example 1. Also 

10 included in the invention are APL derived from the CEA 
peptides by replacing 1 to 6 (i.e., 1, 2, 3, 4, 5, or 6) 
amino acid residues of a naturally processed peptide that 
activates an immune response. Each residue is replaced 
with a different residue, resulting in a variant peptide 

15 that still binds to the same class II MHC allele, but 

elicits qualitatively different responses in 004* T cells 
than does the parent peptide from which the APL is 
derived- Thus, APL have the potential to be therapeutic 
and/or prophylactic in diseases in which the 004"*" T cell 

20 response to the relevant parent peptide is pathogenic. 

The invention features methods of therapy and prophylaxis 
involving the use of APL. The invention also features use 
of disease-related peptides in the diagnosis of disease or 
monitoring immune-based therapy. 

25 

1. Methods of Identifying CD4'*' T Cell Activating 

Peptide Epitopes Derived From Polypeptide Antigens 

The methods of the invention have two distinct 
30 phases. The first is termed "Immunological Mass 
Fingerprinting" (IMF) and the second "Epitope 
Verification" (EV) . The purpose of the IMF is to direct a 
candidate polypeptide to any one of the antigen processing 
compartments of an APC where it can be degraded to peptide 



- 21 - 



f 

wo 02/22803 PCT/USOl/28467 

fragments. Any peptides of the appropriate length (about 
9 to 25 amino acid residues), and having specific binding 
affinity for a particular class II MHC molecule expressed 
by the APC, will bind to that class II MHC molecule in the 
5 antigen processing compartments. At least some of these 
peptide-class II MHC molecular complexes then migrate to 
the cell membrane of the APC. The complexes (both cell- 
membrane associated and intracellular) are isolated from 
the APC and the peptides eluted from the complexes. The 

10 eluted peptides are then separated, their amino acid 

sequences determined, and the sequences compared to that 
of the candidate polypeptide. 

The IMF can generally be applied to the analysis of 
peptides produced by an APC expressing defined class II 

15 MHC molecules. As such, the method can be useful for 

basic research studies, e.g., studies aimed at identifying 
amino acid residues in a polypeptide that determine sites 
of "cutting" by the proteolytic antigen processing enzymes 
of APC. Alternatively, where the polypeptide is suspected 

20 of being an antigen that activates CD4'*' T cells that cause 
or promote a particular disease or mediate protection from 
a disease, the IMF can be used to identify disease-related 
or protective peptide epitopes derived from the 
polypeptide. This information would be useful for basic 

25 research into the etiology of the disease, or as a basis 
for development of diagnostics, therapeutics, or vaccines 
for the disease. 

A peptide whose amino acid sequence matches that of a 
region of the candidate polypeptide is likely to be one 

30 that activates CD4* T cells involved in the pathogenesis of 
or immunity to the relevant disease. Such a peptide can 
be subjected to the EV procedure in which its ability to 
activate CD4^ T cells from test and control subjects is 
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assayed. Those peptides that activate CD4'*" T cells from 
test subjects but not those from control subjects are 
identified as peptides that can initiate, promote, or 
exacerbate the relevant disease or mediate protection from 
5 disease or its pathogenic symptoms. 

Once such a peptide is identified, it can be 
synthesized in large amounts, by chemical or recombinant 
techniques, and used in diagnostic assays similar to the 
EV procedures listed below. Relevant peptides could be 

10 used singly or in combination. Alternatively, expression 
vectors encoding such a peptide or a combination of such 
peptides can be used to transfect or transduce appropriate 
APC (see below), and these can be used in similar 
diagnostic assays- 

15 Furthermore, multimers (e.g., dimers, trimers, 

tetramers, pentamers, or hexamers) of a class II MHC 
molecule complexed with a peptide defined by the method of 
the invention and conjugated to a detectable label (e.g., 
a fluorescent moiety, a radionuclide, or an enzyme that 

20 catalyzes a reaction resulting in a product that absorbs 
or emits light of a defined wavelength) can be used to 
quantify T cells from a subject (e.g., a human patient) 
bearing cell surface receptors that are specific for such 
complexes. Relatively high numbers of such T cells are 

25 likely to be diagnostic of a relevant disease or an 

indication that the T cells are involved in immunity to 
the disease. In addition, continuous monitoring of a 
patient's relative numbers of multimer-binding T cells can 
be useful in tracking the course of a disease or the 

30 efficacy of therapy. Such assays have been developed 

using tetramers of class I MHC molecules complexed with an 
HIV-l-derived or an influenza virus-derived peptide 
[Altman et al. (1996), Science 274:94-96; Ogg et al. 
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(1998), Science 279:2103-2106], and corresponding class II 
MHC multimers would be expected to be similarly useful. 
Such complexes could be produced by chemical cross-linking 
of purified class II MHC molecules assembled in the 
5 presence of a peptide of interest or by modification of 
already established recombinant techniques for the 
production of class II MHC molecules containing a single 
defined peptide [Kazono et al, (1994), Nature 369:151-154; 
Gauthier et al, (1998), Proc. Natl. Acad. Sci. U,S.A- 

10 95:11828-11833]. The class II MHC molecule monomers of 
such multimers can be native molecules composed of full- 
length a and p chains. Alternatively, they can be 
molecules containing only the extracellular domains of the 
a and P chains or the a and p chain domains that form the 

15 "walls" and "floor" of the peptide-binding cleft. 

1.1 IMF 

In IMF, the APC is transfected (transiently or 
stably) with the cDNA encoding the PPI (e.g., CEA) using 

20 standard methods known in the art, and high expressing 
clonal populations are isolated for preparative growth. 
As used herein, the teirm "transfected" includes infected. 
Thus, the cDNA encoding the PPI can be in an expression 
vector such as a plasmid or an infectious agent such as a 

25 virus. Useful viruses include vaccinia virus, measles 
virus, alpha viruses, and herpes viruses- Additional 
suitable viruses are listed below in the section 
describing APL. Infection of the APC can be performed in 
vitro or in vivo in a mammalian subject prior to 

30 harvesting of the cells from the subject. After producing 
a sufficient number of cells (~1 x 10^ to 1 x 10^° 
depending on the expression of the target tumor antigen) , 
the class II MHC molecules of interest are isolated from 
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the APC by any one of various methods known in the art, 
e.g., immunoprecipitation . They may be isolated by 
affinity chromatography by the method of Gorga et al. 
(1987), J- Biol. Chem. 262:16087-16094. 

5 Peptides bound non-covalently to the isolated 

class II MHC molecules are then eluted from the latter. A 
variety of methods known in the art can be used, for 
example, the method of Chicz et al. (1992), Nature 
358:764-768; Chicz and Urban, Immunology Today 15:155-160; 

10 Urban et al. Critical Reviews in Immunology 17:387-397 or 
the novel solid phase extraction protocol as in Example 1. 

The eluted peptides are separated by one of a variety 
of possible chromatographic methods, e.g., reverse phase 
chromatography. All the resulting fractions that contain 

15 peptides are then individually analyzed by MALDI-TOF mass 
spectrometry, using settings that do not fragment the 
peptides. The peptides corresponding to all the "peaks" 
obtained on .the MALDI-TOF spectrum can then be subjected 
to individual amino acid sequence analysis. 

20 Alternatively, only novel responses in the ^'test" cell- 
line can be subjected to amino acid sequence analysis. 
Novel responses are peaks that are not observed in a 
control spectrum generated using a sample of peptides 
obtained by an identical procedure but omitting the 

25 transfection of cells with a plasmid that contains a DNA 
sequence that encodes the PPI . The sequences of the 
individual peptides can be obtained by means known to 
those in the art. They can, for example, be obtained by 
MALDI-TOF, using instrument settings resulting in the 

30 fragmentation of the peptides into small fragments that 
are analyzed by the mass spectrometer. The amino acid 
sequences of the peptides are then compared to that of the 
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PPI. Those with a sequence identical to a region of the 
PPI are candidates for EV, 

Alternatively, other approaches for deriving the 
amino acid sequences of individual peptides can be 
5 utilized. These methods invoke a second dimension of 
peptide separation prior to mass spectrometric analysis. 
This is often achieved by coupling a separation technique 
such as reversed phase HPLC to a mass analyzer (such as 
but not limited to a quadrupole ion trap, triple 

10 quadrupole instrument, magnetic sector, Fourier 

transformation cyclotron resonance, quadropole time-of- 
f light, or a hybrid of these analyzers) though an 
elect rospray interface. First, novel peptide responses 
that were detected in the ^^test" transfected cell-line but 

15 not the control can be specifically isolated by the mass 
spectrometer and fragmented to yield amino acid sequences. 
Second, fractions can be analyzed in a data dependent mode 
of operation. In this mode, mass peaks are dynamically 
and automatically selected for isolation and fragmentation 

20 to yield amino acid sequences. No prior knowledge of 
novel signals is required for this mode of peptide 
sequencing. 

1.2 EV 

25 The EV procedure involves testing of peptides 

identified by IMF for their ability to (a) bind the 
class II MHC from which they were eluted and (b) activate 
various 004*^ T cell populations. Peptides with amino acid 
sequences either identical to those identified by IMF or 

30 corresponding to a core sequence derived from a nested 
group of peptides identified by the IMF are synthesized. 
The synthetic peptides are then tested for their ability 
to bind the class II MHC from which they were eluted and 
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activate T cells from (a) test subjects expressing the 

class II MHC molecule of interest and having at least one 
symptom of the disease; and (b) control subjects 
expressing the class II MHC molecule of interest and 
5 having no symptoms of the disease. Additional control 

subjects can be those with symptoms of the disease and not 
expressing the class II MHC molecule of interest. In some 
diseases (e.g., those with an autoimmune component) , 
responsiveness in the CD4'*' T cells of test subjects but not 

10 in CD4* T cells of the control subjects described in (b) 
provides confirmatory evidence that the relevant peptide 
is an epitope that activates CD4* T cells that can 
initiate, promote, or exacerbate the relevant disease. In 
other diseases (e.g., cancer or infectious diseases 

15 without an autoimmune component), a similar pattern of 

responsiveness and non-respo'nsiveness to that described in 
the previous sentence would indicate that the relevant 
peptide is an epitope that activates CD4* T cells that can 
mediate immunity to the disease or, at least, a decrease 

20 in the symptoms of the disease. 

Absence of a response in subjects with symptoms of 
the disease but not expressing the class II MHC molecule 
provides further evidence for the stated activities of the 
peptide. On the other hand, a response in such control 

25 CD4* T cells would not necessarily exclude such a role for 
the peptide but would suggest that the relevant peptide is 
capable of (i) binding to some MHC class II molecule 
expressed by the relevant subject; and (ii) being 
recognized by CD4'^ T cells in association with that class 

30 II MHC molecule. If these characteristics are confirmed 
for multiple MHC class II alleles, then the peptide is 
likely to have widespread utility in patients expressing 
different class II alleles. 
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CD4* T cell responses can be measured by a variety of 
in vitro methods known in the art. For example, whole 
peripheral blood mononuclear cells (PBMC) can be cultured 
with and without a candidate synthetic peptide and their 
5 proliferative responses measured by, e.g., incorporation 
of [^H] -thymidine into their DNA. That the proliferating 
T cells are CDi* T cells can be tested by either 
eliminating CD4* T cells from the PBMC prior to assay or by ^ 
adding inhibitory antibodies that bind to the CD4* molecule 

10 on the T cells, thereby inhibiting proliferation of the 
latter. In both cases, the proliferative response will be 
inhibited only if CD4* T cells are the proliferating cells. 

Alternatively, CD4'*" T cells can be purified from PBMC 
and tested for proliferative responses to the peptides in 

15 the presence of APC expressing the appropriate class II 
MHC molecule. Such APC can be B~lymphocytes, monocytes, 
macrophages, or dendritic cells, or whole PBMC. APC can 
also be immortalized cell lines derived from B- 
lymphocytes, monocytes, macrophages, or dendritic cells. 

20 The APC can endogenously express the class II MHC molecule 
of interest or they can express transfected 
polynucleotides encoding such molecules. Where the 
subjects are humans, the APC can also be T cells since 
human T cells are capable of expressing class II MHC 

25 molecules- In all cases the APC can, prior to the assay, 
be rendered non-proliferative by treatment with, e.g., 
ionizing radiation or mitomycin-C. 

As an alternative to measuring cell proliferation, 
cytokine production by the CD4'*' T cells can be measured by 

30 procedures known to those in art. Cytokines include, 

without limitation, interleukin-2 (IL-2) , IFN-y, IL-4, IL- 
5, TNF-a, interleukin-3 (IL-3) , interleukin-6 (IL-6) , 
interleukin-10 (IL-10), interleukin-12 (IL-12) , GM-CSF, 
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RANTES, MlP-la, MIP-ip and transforming growth factor p 
(TGFP) . Assays to measure them include, without 
limitation, ELISA, ELISPOT and bio-assays in which cells 
responsive to the relevant cytokine are tested for 
5 responsiveness (e.g., proliferation) in the presence of a 
test sample. Alternatively, cytokine production by CD4"*" 
lymphocytes can be directly visualized by intracellular 
immunofluorescence staining and flow cytometry. 

Once peptide epitopes associated with a particular 

10 disease have been identified, the EV described above can 
be used as a diagnostic test for the disease. Thus, 
lymphocytes from a subject suspected of having or being 
susceptible to the disease can be tested by any of the 
described methods for a CDA^ T lymphocyte response to one 

15 or more (e.g., 2, 3, 4, 5, 6, 10, 15, or 20) appropriate 
peptides. If a significant 004"*" T lymphocyte is detected, 
it is likely that the subject has or will develop the 
disease- The disease can be, for example, cancer and the 
peptides can be derived from, for example, CEA. 

20 Appropriate peptides can be, for example, any of those 

listed below (e.g., those with SEQ ID N0S:l-7 and 13-15). 

As an alternative to the above-described EV, peptides 
identified by the IMF can be tested for their ability to 
bind to an appropriate class II MHC molecule by methods 

25 known in the art using, for example, isolated class II MHC 
molecules or cells transfected with nucleic acid molecules 
encoding class II MHC molecules. One such method is 
described in Example 2. These binding assays can also be 
used to test the ability of the peptides to bind to 

30 alternative class II MHC molecules, i.e., class II MHC 
molecules other than those from which they were eluted 
using the IMF method of the invention. Once such 
alternative class II MHC molecules are shown to bind the 
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peptide (s), the diagnostic methods of the invention (using 
such peptides) and therapeutic methods of the invention 
(using either the peptides or APL derived from them) can 
be applied to subjects expressing such alternative class 
5 II MHC molecules. 

1.3 Diseases and their associations with class II 
MHC genes 

The methods of the invention can be applied to the 
analysis of peptides involved in diseases associated with 

10 expression of defined class II MHC molecules and in which 
pathology or protection is due to the action of activated 
CD4* T cells. Such diseases include, without limitation, 
certain infectious diseases, cancer, and autoimmune 
diseases- Three are discussed below. 

15 An example of an infectious disease that fulfills the 

above criteria is hximan leprosy, which is caused by 
Mycobacterium leprae. The bacteria infect and thrive in 
peripheral Schwann cells and macrophages. The disease is 
characterized by depressed cellular immunity but normal 

20 antibody responses. Leprosy has been associated with the 
expression of DRBl class II MHC molecules in which codon 
13 encodes Arg or codons 70 and 71 encode Arg [Zerva et 
al. (1996), J. Exp. Med. 183: 829-836]. 

The ability to spontaneously clear hepatitis C virus 

25 is associated with expression of DQB1*0301 molecules. 

Since DQB1*0302 is under-represented in hepatitis virus C 
infected subjects, DQB1*0302 expressing individuals may be 
protected from infection with the virus [Cramp et al. 
(1998), J. Hepatol. 29:207-213]. 

30 Melanoma cell-specific CD4''' T cells, which may be • 

involved in protective immune responses to malignant 
melanoma, recognize tyrosinase epitopes presented by HLA- 
DRB1*0401 class II molecules [Topalian et al. (1996), 
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supra] • With respect to cancers, the reticular cell 
sarcomas of SJL mice are dependent for growth on cytokines 
produced by activated 004"*" T cells and require the 
expression of certain class II MHC molecules. 

5 Examples of autoimmune diseases to which the methods 

of the invention can be applied include, without 
limitation, IDDM, rheumatoid arthritis (RA) , multiple 
sclerosis (MS), systemic lupus erythmatosus (SLE) , and 
myasthenia gravis (MG) . RA is associated with expression 

10 of DRBl alleles (DRB1*0101, 0401, 0403, and 0405)- MS is 
associated with expression of DRB1*1501, DQA1*0102 and 
DQB1*0602 alleles. SLE is associated with the expression 
of DRB1*03, DRB1*1501, DQA1*0501 and DQB1*0201 alleles. 
MG is associated. with the expression of DR3 and DQ2 

15 (DQA1*0501-DQB1*0201 and DQA1*0201-DQB1*0201) alleles. 
Autoimmune ovarian failure is associated with DQBl genes 
encoding Asp at position 57. Graves' thyroiditis, 
Hashimoto's thyroiditis, and primary hypothyroidism all 
show weak association with the expression of the DR5 and 

20 DR3 alleles. Coeliac disease is associated with the 
expression of HLA-DQA1*0501 and DQB1*0201 alleles. 
Primary biliary cirrhosis is associated with the 
expression of DRB1*0801-DQA1*0401/0601-DQB1*04 alleles. 
Autoimmune hepatitis is associated with the expression of 

25 DRB3*0101 and DRB1*0401 alleles. Addison's disease is 
associated with the expression of DRB1*03, DQA1*0501 and 
DQB1*0201 alleles. Vitiligo is associated with the 
expression of DRB1*0701 and DQ2 alleles. Anti-glomerular 
basement membrane disease (Goodpasture's syndrome) is 

30 associated with the expression of DR15 and DR4 alleles. 

Pathology in RA, MS, and IDDM is considered to be due 
predominantly to 004"^ T cell-dependent, cell-mediated 
autoimmune responses, while that of SLE and MG is due 
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predominantly to CD4* T cell-dependent, antibody-mediated 
autoimmune responses. In RA the inflammatory response 
induced by the activated CD4* T cells is focused on joint 
synovia, in MS on neural myelin sheaths, and in IDDM on 
5 pancreatic P cells located in the islets of Langerhans. 
SLE is a systemic autoimmune disease involving multiple 
organs. The muscle fatigue observed in MG is due to the 
development in the patient of antibodies that bind to the 
acetylcholine receptor in neuromuscular junctions. 
10 Other MHC class-II associated diseases are listed 

above . 

1.4 Species 

The methods of the invention can be applied to 
15 diseases with the described characteristics in a wide 
range of mammalian species, e.g., humans, non-human 
primates, horses, cattle, pigs, sheep, goats, dogs, cats, 
rabbits, guinea pigs, hamsters, rats, and mice. They will 
preferably be applied to diseases of humans. 

20 

1-5 Class II MHC Molecules 

Class II MHC molecules have been identified in 
multiple mammalian species. In some of these species, 
expression of a particular class II MHC molecule has been 
25 associated with a particular CD4* T cell-mediated diseases 
(see above) . In humans, for example, the class II MHC 
molecules are designated HLA-DR, HLA-DQ, and HLA-DP and in 
mice, H-2A and H-2E. In all species, there are multiple 
alleles of each gene. 

30 

1.6 Antigen Presenting Cells (APC) 

APC that can be used for the IMF methods of the 
invention can be any mammalian cell that expresses class 
II MHC molecules on its surface, e.g., those listed above 
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for use in EV (B lymphocytes, macrophages, monocytes, 
dendritic cells, and, in humans, T cells) . APC can also 
be tumor cells, e.g., B cell lymphoma cells or melanoma 
cells. It is also not required that the cells 
5 constitutively express class II MHC molecules. Class II 
MHC can be induced (in vitro or in vivo) (e.g., by IFN-y) 
in such cells. Alternatively, immortalized lines of such 
cells can be used. 

10 1.7 Polypeptide Antigens 

Polypeptide antigens that can be used with the IMF 
methods can be those with a known amino acid sequence or 
those in which at least part of the amino acid sequence is 
known. They can be polypeptides that themselves are known 

15 or suspected to be involved in the disease process or 

immunity to the disease (e.g., CEA in cancer) or they can 
be derived from microbial organisms known or suspected to 
be involved in the disease process (e.g., M, leprae in 
leprosy) . Examples of other polypeptide antigens include 

20 the core and viral coat proteins of viruses such as 
hepatitis C virus, the heat shock proteins of 
mycobacteria, and tyrosinase in melanoma- Furthermore, 
the polypeptide antigen can be the full-length protein or 
it can be a fragment of the protein known or suspected to 

25 be involved in the disease process. 

2. Peptides 

Peptides of the invention include, but are not 
limited to, peptides that bind to class II MHC molecules 
30 and activate CD4'*' T cells involved in a disease process or 
protection from a disease. The class II MHC molecule can 
be a class II MHC molecule that is associated with 
susceptibility or resistance to a disease. Diseases can 
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be any of the diseases cited herein and the species from 
which* the class II MHC molecules and/or peptides are 
obtained can be any of those cited herein. The class II 
MHC molecules are preferably human class II HLA molecules r 
5 i.e., DR, DP or DQ molecules- The peptides can be, for 
example, peptides that bind to DR4 or DRl molecules. The 
polypeptides from which the peptides of the invention are 
derived can be any of those cited herein. The peptides 
generally are 9 to 30 (e.g., 13 to 25) amino acids in 
10 length - 

The peptides can be derived, for example, from CEA 
and can bind to HLA-DRl or HLA-DR4 molecules. The 
peptides can be, for example, any one of the following 
peptides : 

15 KEVLLLVHNLPQH (SEQ ID N0:1); KEVLLLVHNLPQHL (SEQ ID NO: 2;) 
. KEVLLLVHNLPQHLF (SEQ ID NO: 3); KEVLLLVHNLPQHL FG (SEQ ID 
NO: 4); GKEVLLLVHNLPQHL (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG 
(SEQ ID N0:6); YLWWVNGQSLPVSPR (SEQ ID N0:7); 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID 

20 NO: 14); or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). Also 
included are peptides containing any of the above 
sequences, plus 1-15 residues on either end or 1-15 
residues on each end. 

In addition, the peptides can be isolated fragments 

25 of CEA which are shorter than full-length CEA. In such 

peptides, the fragments can contain one or more (e.g., one 
two, three, four, five, six, seven, eight, nine, or ten) 
amino acid sequences which can be, for example, those 
amino acid sequences with SEQ ID N0S:l-7 and 13-15. 

30 The peptides can also be peptides that include one or 

more CEA segments (e.g., SEQ ID N0S:l-7 and 13-15) and one 
or more (e.g., one, two, three, four, five, six, seven, 
eight, nine, ten, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 



- 34 - 



wo 02/22803 



PCT/USOl/28467 



21, 22, 23, 24, 25, 30, 35, 40, 50, 55, 60, 65, 70, 75, 
80, 90, 100, 150, 200, 250, 300, 400, 500, or more) amino 
acids at the C-terminus, the N-terminus, or both the C- 
terminus and the N-terminus of the CEA segment. The 
5 additional amino acid residues can be CEA or non-CEA amino 
acids. The peptide will, however, not be identical to 
full-length CEA. 

In addition, peptides identified as being associated 
with any of the diseases listed herein (e.g., cancer such 

10 as colon cancer or any of the infectious diseases recited 
herein) can be used to activate lymphocytes (e.g., CD4+ 
lymphocytes) that cause, directly or indirectly, the death 
of pathogenic target cells such as cancer cells or 
pathogen-infected cells. Alternatively, in other diseases 

15 such as the autoimmune diseases recited herein, the 

peptides can be used to induce immunological tolerance in 
lymphocytes (e.g., CD4+ T lymphocytes) associated with the 
initiation, progress, or pathological symptoms of the 
disease. Tolerization of these lymphocytes can be useful 

20 for prophylaxis against and/or therapy of the relevant 
disease. Lymphocyte activation or tolerization can be 
achieved by administering an appropriate peptide to a 
subject. Methods of testing for efficacy of a peptide in 
activating or in inducing tolerance in lymphocytes (e.g., 

25 CD4+ lymphocytes), methods and routes of administration, 
and doses to be administered are essentially the same as 
those described below for APL. Furthermore, the peptides 
can be modified in any of the ways described below for 
APL. In addition, peptidomimetic forms of the peptides 

30 can be produced by methods known in the art (see below in 
section on APL) . The peptides can be fragments of any the 
polypeptides disclosed herein, e.g., CEA, PSA, insulin, 
proinsulin, preproinsulin, GAD65, IA-2, or phogrin. They 
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can be, for exan^le, those with SEQ ID N0S:l-7 and 13-15 
or any of those described above. 

The invention also includes a method of activating or 
tolerizing CD4^ T cells in which full-length CEA or DNA 
5 encoding full-length CEA is administered to a mammal. 

Following this administration, CD4'*" T cells of the mammal 
are tested for responsiveness to a naturally processed 
fragment of CEA. Such a fragment can be, but is not 
limited to, a fragment with any of SEQ ID N0S:l-7 and 13- 
10 15. 

The CEA peptides of the invention (e.g.,. those with 
SEQ ID N0S:l-7 and 13-15) can be used for purposes other 
than therapy. They can be used, for example, in 
diagnostic assays and for methods of screening for 

15 reagents that bind to complexes of class II MHC molecules 
and CEA peptides (see below) . 

The peptides can be prepared using the described IMF 
methodologies. Smaller peptides (fewer than about 50 
amino acids long) can also be conveniently synthesized by 

20 standard chemical means. In addition, both polypeptides 
and peptides can be produced by standard in vitro 
recombinant DNA techniques, and in vivo transgenesis using 
the nucleotide sequences encoding the appropriate 
polypeptides, peptides or APL. Methods well known to 

25 those skilled in the art can be used to construct 

expression vectors containing relevant coding sequences 
and appropriate transcriptional/translational control 
signals- See , for example, the techniques described in 
Sambrook et al.. Molecular Cloning: A Laboratory Manual, 

30 Cold Spring Harbor Laboratory, N.Y., 1989, and Ausubel et 
al.. Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y., 1989. 
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The invention also features isolated nucleic acid 
molecules encoding the peptides of the invention. These 
nucleic acid molecules can be cDNA, genomic DNA, synthetic 
DNA, or RNA, and can be double-stranded or single-stranded 
5 {i.e., either a sense or an antisense strand). Segments 
of these molecules are also considered within the scope of 
the invention, and can be produced, for example, by the 
polymerase chain reaction (PGR) or generated by treatment 
with one or more restriction endonucleases . A ribonucleic 

10 acid (RNA) molecule can be produced by in vitro 

transcription. Preferably, the nucleic acid molecules 
encode peptides that, regardless of length, are soluble 
under normal physiological conditions. 

The nucleic acid molecules of the invention can 

15 contain naturally occurring sequences, or sequences that 
differ from those that occur naturally, but, due to the 
degeneracy of the genetic code, encode the same 
polypeptide (for example, the peptides with SEQ ID N0S:l-7 
and 13-15) . In addition, these nucleic acid molecules are 

20 not limited to coding sequences, e.g., they can include 
some or all of the non-coding sequences that lie upstream 
or downstream from a coding sequence. 

The nucleic acid molecules of the invention can be 
synthesized (for example, by phosphoramidite-based 

25 synthesis) or obtained from a biological cell, such as the 
cell of a mammal. Thus, the nucleic acids can be those of 
a human, non-human primate (e.g., monkey), mouse, rat, 
guinea pig, cow, sheep, horse, pig, rabbit, dog, or cat. 
In addition, the isolated nucleic acid molecules of 

30 the invention encompass segments that are not found as 
such in the natural state. Thus, the invention 
encompasses recombinant nucleic acid molecules (for 
example, isolated nucleic acid molecules encoding any of 
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the peptides described herein) incorporated into a vector 
(for example, a plasmid or viral vector) or into the 
genome of a heterologous cell (or the genome of a 
homologous cell, at a position other than the natural 
5 chromosomal location) . 

Techniques associated with detection or regulation of 
genes are well known to skilled artisans and such 
techniques can be used to diagnose and/or treat disorders 
associated with aberrant CEA expression, e.g., colon 

10 cancer- Hybridization can be used as a measure of 
homology between two nucleic acid sequences. Thus a 
nucleic acid encoding a peptide of the invention (e.g., a 
CEA peptide such as a peptide with SEQ ID NOs: 1-7 or IS- 
IS) , or a portion of such a nucleic acid, can be used as 

15 hybridization probe according to standard hybridization 
techniques. The hybridization of a CEA peptide probe to 
DNA or RNA from a test source (e.g., a mammalian cell) is 
an indication of the presence of CEA DNA or RNA, 
respectively, in the test source. Hybridization 

20 conditions are known to those skilled in the art and can 
be found in Current Protocols in Molecular Biology, John 
Wiley & Sons, N.Y., 6.3.1-6.3.6, 1991. Moderate 
hybridization conditions are defined as equivalent to 
hybridization in 2X sodium chloride/sodium citrate (SSC) 

25 at 30°C, followed by one or more washes in 1 X SSC, 0,1% 

SDS at 50-60°C. Highly stringent conditions are defined as 
equivalent to hybridization in 6X sodium chloride/sodium 
citrate (SSC) at 45°C, followed by one or more washes in 
0.2 X SSC, 0.1% SDS at 50-65°C. 

30 The invention also encompasses: (a) vectors that 

contain any of the foregoing peptide coding sequences 
and/or their complements (that is, "antisense" sequence); 
(b) expression vectors that contain any of the foregoing 
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peptide coding sequences operatively associated with any 
transcriptional/translatiorial regulatory element (examples 
of which are given below in the section on APL) necessary 
to direct expression of the coding sequences; 
5 (c) expression vectors containing, in addition to 

sequences encoding a peptide of the invention, nucleic 
acid sequences that are unrelated to nucleic acid 
sequences encoding the peptide of the invention, such as 
nucleic acid sequences encoding a reporter, marker, or a 
10 signal peptide (e.g., a heterologous signal peptide); and 
(d) genetically engineered host cells that contain any of 
the foregoing expression vectors and thereby express the 
nucleic acid molecules of the invention. 

Where the nucleic acids form part of a hybrid gene 
15 encoding additional polypeptide sequences, for example, 
sequences that function as a marker or reporter, markers 
or reporter genes include p-lactamase, chloramphenicol 
acetyltransferase (CAT), adenosine deaminase (ADA), 
aminoglycoside phosphotransferase (neo^, 6418"^) , 
20 dihydrofolate reductase (DHFR), hygromycin-B- 

phosphotransf erase (HPH) , thymidine kinase (TK) , lacZ 
(encoding p-galactosidase) , and xanthine guanine 
phosphoribosyltransferase (XGPRT) • As with many of the 
standard procedures associated with the practice of the 
25 invention, skilled artisans will be aware of additional 
useful reagents, for example, additional sequences that 
can serve the function of a marker or reporter. 
Generally, the hybrid polypeptide will include a first 
portion and a second portion, the first portion being a 
30 peptide of the invention (e.g., a peptide with any of SEQ 
ID N0S:l-7 or 13-15) and the second portion being, for 
example, an immunoglobulin constant region. 
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A variety of host-expression vector systems can be 
used to express the peptides and polypeptides. Such host- 
expression systems represent vehicles by which the 
polypeptides of interest can be produced and subsequently 

5 purified, but also represent cells that can, when 

transformed or transfected with the appropriate nucleotide 
coding sequences, produce the relevant peptide or 
polypeptide in situ. These include, but are not limited 
to, microorganisms such as bacteria, e.g., E. coli or 

10 B. subtilis, transformed with recombinant bacteriophage 
DNA, plasmid or cosmid DNA expression vectors containing 
polypeptide coding sequences; yeast, e.g., Saccharomyces 
or Pichia, transfonned with recombinant yeast expression 
vectors containing the appropriate coding sequences; 

15 insect cell systems infected with recombinant virus 

expression vectors, e.g., baculovirus; plant cell systems 
infected with recombinant virus expression vectors, e.g., 
cauliflower mosaic virus (CaMV) or tobacco mosaic virus 
(TMV), or transformed with recombinant plasmid expression 

20 vectors, e.g., Ti plasmids, containing the appropriate 
coding sequences; or mammalian cell systems, e.g., COS, 
CHO, BHK, 293 or 3T3, harboring recombinant expression 
constructs containing promoters derived from the genome of 
mammalian cells, e.g., metallothionein promoter, or from 

25 mammalian viruses, e.g., the adenovirus late promoter or 
the vaccinia virus 7.5K promoter. 

3, APL 

An altered peptide ligand (APL) is a variant peptide 
30 in which 1-6 (i.e., 1, 2, 3, 4, 5, or 6) amino acid 

residues of a parent wild-type peptide that activates a 
response in CDA^ T cells have been changed. In an APL of 
the invention, fewer than half of the residues of the 



- 40 - 



wo 02/22803 



PCT/USOl/28467 



wild-type peptide are changed, e.g., fewer than 50%, fewer 
than 40%, fewer than 30%, fewer than 25%, fewer than 20%, 
fewer than 15%, fewer than 10%, or fewer than 5%. Thus, 
for example, in an APL derived from a wild-type peptide 
5 20 amino acids long and differing from the wild- type 
peptide at 6 positions, 30% of the amino acids of the 
wild- type peptide are changed. Alternatively, in an APL 
derived from a wild-type peptide 15 amino acids long and 
differing from the wild-type peptide at 3 positions, 20% 

10 of the amino acids of the wild- type peptide are changed. 

An APL retains at least some ability to bind to the 
class II MHC molecule to which the parent peptide binds 
and at least some ability to be recognized by the antigen- 
specific T lymphocyte receptor (s) of the T cell(s) 

15 that recognize the parent peptide bound to the same type 
of class II MHC molecule. However, by definition, the APL 
activates a response in the 004**" T cells that is 
qualitatively different from that activated by the parent 
peptide. For example, while the parent peptide can 

20 activate a helper T cell 1- (Thl-)type response in which 
the cytokines interleukin-2 {IL-2) , interferon-y (IFN-y), 
and tumor necrosis factor-a (TNF-a) are produced by the 
activated CD4^ T cells, an APL derived from this parent 
peptide might instead activate a helper T cell 2- (Th2-) 

25 type response in the CD4'*" T cells. In a Th2 response, the 
cytokines interleukin-4 (IL-4) , interleukin-5 (IL-5) , and 
interleukin-10 (IL-10) are produced by the activated CD4**' 
T cells. Alternatively, if a particular parent peptide 
elicits a Th2 response in a given CD4"*' T cell, an APL 

30 derived from the parent peptide could activate a Thl 
response in the T cell. Some APL have been shown to 
switch a Thl response to a ThO response in which both Thl- 
and Th2-type cytokines are produced. Furthermore, an APL 
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could redirect a CD4**" T cell response towards a Th3-type 
response in which the predominant cytokine produced is 
transforming growth factor-p (TGF-p) . TGF-p has been 
shown. to be suppressive of a wide range of immune 
5 responses. 

In general^ Thl responses are associated with cell- 
mediated immune responses and Th2 responses are associated 
with antibody- (i.e., B cell-) mediated immune responses. 
Thus, the relative number of 004**" T cells responding in a 
10 Thl- versus a Th2-type fashion will determine the nature 
(cell-mediated versus antibody -mediated) of the immune 
response generated by an antigen in a particular 
individual . 

Some conditions, and in particular autoimmune 
15 diseases (e.g., RA, IDDM, and MS), have been shown to be 
due to cellular immune responses and thus to be dependent 
on Thl 004**" T cell responses. Other diseases (e.g., MG and 
SLB) have been shown to be mediated by antibody (i-e., B- 
cell) responses, and thus to be dependent on Th2 004"^ T 
20 cell responses. Thus, an APL that serves to direct a 004"^ 
T cell response from a Thl to a Th2 response can be useful 
in treatment or prevention of the first category of 
diseases and an APL that serves to alter a CD4* T cell 
response in a Th2 to Thl direction can be useful in the 
25 treatment or prevention of the second category of 
diseases . 

The amino acid substitutions in APL can be radical. 
For example, an amino acid with a positively charged side 
chain (e.g., lysine) can be replaced by an amino acid with 
30 a negatively charged side chain (e.g., aspartic acid) or a 
hydrophobic side chain (e.g., isoleucine) and vice versa. 
In addition, an amino acid with a bulky side chain (e.g., 
tryptophan) can be replaced with an amino acid with small 
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side chain (e.g,, glycine or alanine) and vice versa. 
Alternatively, the substitutions can be conservative. For 
example/ a negatively charged amino acid can be replaced 
with another negatively charged amino acid (e.g., aspartic 

5 acid with glutamic acid) or one hydrophobic amino acid 
with another hydrophobic amino acid (e.g., leucine with 
valine or isoleucine) . 

Methods to test whether a given APL elicits a 
predominantly Thl, Th2, Th3, ThO, or other 004^^ T cell 

10 response are known in the art- In brief , an APL of 

interest can be administered to a test subject (e.g., a 
mouse) expressing a class II MHC molecule of interest 
(e.g., a human class II MHC molecule) by any one of a 
variety of routes, e.g., intramuscular, intravenous, 

15 subcutaneous, intradermal, intraperitoneal, intrarectal, 
intravaginal, intranasal, intragastric, intratracheal, or 
intrapulmonary. In addition, administration can be oral 
or transdeirmal, employing a penetrant such as a bile salt, 
a fusidic acid or another detergent. The injections can 

20 be single or multiple (e.g., 2-, 3-, 4-, 6-, 8-, or 10- 
fold) . The peptide can be administered in a 
physiologically acceptable solution (e.g., a saline 
solution) and can be administered with or without an 
adjuvant (e.g., Freund's complete or incomplete adjuvant 

25 or cholera toxin) • After immunization, the animal can be 
challenged with the APL, either in vivo or in vitro, by 
methods known in the art, and the levels of individual 
cytokines produced measured. In the case of an in vivo 
challenge, the cytokine secreted into the blood or some 

30 other bodily fluid (e.g., urine, saliva, or semen) or 
lavage (e.g., nasal, pulmonary, rectal, gastric, or 
vaginal lavages) can be measured. Alternatively, lymphoid 
cells can be isolated from the animal after challenge and 
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the level of cytokines produced by the cells can be 
tested, e.g., by culturing and measurement of cytokine 
levels in culture supernatant by, e.g., ELISA. Isolated 
lymphoid cells can also be tested for relative numbers of 
5 cells producing the cytokines by assays such as the 
ELISPOT assay or fluorescence analysis following 
intracellular staining with one or more cytokine binding 
antibodies, each conjugated with a different fluorophore 
which emits light of a distinct wavelength. Fluorophores 

10 fluorescing at different wavelengths (i.e., colors) are 
known in the art. Using such fluorescence assays, it is 
possible to ascertain the range of cytokines being 
produced by a single cell. If the lymphoid cells are 
challenged in vitro, assays such as the ELISPOT assay or 

15 ELISA can be used. Should immunization and challenge with 
an APL result in relatively low levels of IL-2, IFN-y, and 
TNF-a and relatively high levels of IL-4, IL-5, and IL-10, 
while immunization and challenge with the parental peptide 
results in the inverse pattern, the conclusion would be 

20 that the APL is useful for switching a response from a Thl 
to a Th2 pattern of cytokine production. Where the Thl 
response is pathogenic, treatment with the APL can be 
therapeutic or prophylactic. Similarly, APL and their 
parent peptides can be tested for their relative abilities 

25 to shift a response from a Th2 type response to a Thl type 
response, or from a Thl-type response to a ThO- or a Th3- 
type response. The ability of an APL to activate 
responses such as CD4'*' CTL responses, anti-angiogenic 
responses, or macrophage/eosinophil activating responses 

30 can be determined by methods familiar to those in the art. 
APL can also be tested by any of the protocols in 
human volunteers. Alternatively, lymphoid cells could be 
isolated from a subject (e.g., a human subject) and both 
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immunized and challenged in vitro. In addition, APL can 
be administered to "SCID-Hu" mice, which are mice 
genetically deficient in murine T and B lymphocytes and 
reconstituted with human lymphoid cells. Due to the 
5 inherent immunological deficiency in these animals, the 
human lymphoid cells are not rejected and will engraft. 
After immunizing and challenging these mice with an APL 
(using any of methodologies described above) , their 
cytokine responses can be measured by any of the methods 

10 described above. To ensure that the cytokines detected in 
the assays are of human origin, hioman species-specific 
reagents (e.g., antibodies) can be used for the assays 
(e.g., ELISA or ELISPOT) . Furthermore, in order to 
exclude presentation of the APL to human CD4* T cells by 

15 murine class II MHC molecules, SCID mice could be bred 
with class II MHC "knockout" mice in order to generate 
mice deficient in both lymphocytes and murine class II MHC 
molecules. By reconstituting the resulting mice with 
human lymphoid cells, a SCID-Hu mouse is provided in which 

20 essentially the only CD4* T cells capable of responding are 
human CD4^ T cells and the only class II MHC molecules 
capable of presenting the APL are human class II MHC 
molecules on the surface of human lymphoid cells. 
Alternatively, the recipient of human lymphoid cells could 

25 be a hybrid mouse derived by breeding SCID mice with DR 
(e.g., DR4) or DQ (e.g., DQ8) transgenic mice made on a 
class II MHC knockout background. Again the only class II 
MHC molecules present would be the human DR or DQ 
molecules contributed by the transgenic parental mice. 

30 RAG-1 deficient mice can be used instead of the SCID 

mice for generation of the described human/mouse chimeric 
animals, RAG-1 deficient mice, like SCID mice, lack T and 
B lymphocytes but have the advantage that the relevant 
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mutation is not "leaky." Thus, while late in life SCID 
mice can develop a low number of lymphocytes, this does 
not occur in RAG-1 deficient mice. 

The APL of the invention can be obtained by any of 

5 the methods described above for peptides and polypeptides. 
APL of the invention also include those described above, 
but modified for In vivo use by the addition, at either or 
both the amino- and carboxyl-terminal ends, of a blocking 
agent to facilitate survival of the relevant peptide in 

10 vivo. This can be useful in those situations in which the 
peptide termini tend to be degraded by proteases in vivo. 

Alternatively, blocking agents such as pyroglutamic 
acid or other molecules known in the art can be attached 
to the amino and/or carboxyl terminal residues, or the 

15 amino group at the amino terminus or carboxyl group at the 
carboxyl terminus can be replaced with a different moiety. 
Likewise, the peptides can be covalently or noncovalently 
coupled to pharmaceutically acceptable "carrier" proteins 
prior. to administration. 

20 Also of interest are peptidomimetic compounds that 

are designed based upon the amino acid sequences of the 
APL. Peptidomimetic compounds are synthetic, non-peptide 
compounds having a three-dimensional conformation (i.e., a 
"peptide motif") that is substantially the same as the 

25 three-dimensional conformation of a selected peptide. The 
peptide motif provides the peptidomimetic compound with 
the ability to activate CD4* T cells in a manner 
qualitatively identical to that of the APL from which the 
peptidomimetic was derived. Peptidomimetic compounds can 

30 have additional characteristics that enhance their 

therapeutic utility, such as increased cell permeability 
and prolonged biological half-life. 
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The peptidomimetics typically have a backbone that is 
partially or completely non-peptide, but with side groups 
that are identical to the side groups of the amino acid 
residues that occur in the peptide on which the 

5 peptidomimetic is based. Several types of chemical bonds, 
e.g., ester, thioester, thioamide, retroamide, reduced 
carbonyl, dimethylene and ketomethylene bonds, are known 
in the art to be generally useful substitutes for peptide 
bonds in the construction of protease-resistant 

10 peptidomimetics . 

4. Methods of Therapy Using APL 

An APL that has the ability to elicit a cytokine 

response in 004"*" T cells that is non-pathogenic and/or is 

15 suppressive of a pathogenic CD4* T cell cytokine response 

elicited by the APL's parental peptide, could be useful in 

therapy, palliation, or prophylaxis of a disease caused by 

the pathogenic CD4'*' T lymphocyte response to the parental 

peptide. 

20 For example, if "peptide x". elicits a potent Thl-type 

response in 004"*" T cells in a patient with a HLA-DR4, 
treatment of the patient with a peptide x-derived APL that 
elicits a Th2 T cell response can be therapeutic or 

palliative in that patient. Alternately, if peptide x 

25 elicits a Thl response, an APL derived from peptide x 
could elicit a CD4'*' T-cell response that enhances (a) 
migration of CD8*^ tumoricidal CTL to the vicinity of the 
tumor cells for identification and specific killing of the 
tumor cells; or (b) migration of tumoricidal macrophages 

30 and eosinophils to the vicinity of the tumor cells and 
thereby enhance killing of the tumor cells . 
Alternatively, an APL could elicit, an anti-angiogeneic 
response in the microenvironment of the tvmior that cuts 
off the nutrient supply needed for continued tumor cell 
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division and consequent tumor growth. Furthermore, an APL 
derived from a tumor antigen peptide could activate a CD4"*' 
CTL response more effectively than the parent peptide from 
which the APL was derived. 
5 These methods of the invention fall into 2 basic 

classes, i.e., those using in vivo approaches and those 
using ex vivo approaches. 

4.1 In Vivo Approaches 

10 In one in vivo approach, the APL (peptide or 

peptidomimetic) itself is administered to the subject by 
any of the routes listed above. It is preferably 
delivered directly to an appropriate lymphoid tissue (e.g. 
spleen, lymph node, or mucosal-associated lymphoid tissue 

15 (MALT) ) . 

The dosage required depends on the choice of APL, 
the route of administration, the nature of the 
formulation, the nature of the patient's illness, and the 
judgment of the attending physician. Suitable dosages are 

20 in the range of 0.1-100.0 fig/kg. Wide variations in the 
needed dosage are to be expected in view of the variety of 
APL available and the differing efficiencies of various 
routes of administration. For example, oral 
administration would be expected to require higher dosages 

25 than administration by i.v. injection. Variations in 
these dosage levels can be adjusted using standard 
empirical routines for optimization, as is well understood 
in the art. 

Alternatively, a polynucleotide containing a 
30 "minigene" encoding the APL can be delivered to an 
appropriate cell of the animal. Expression of the 
minigene will preferably be directed to lymphoid tissue of 
the subject by, for example, delivery of the 
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polynucleotide to the lymphoid tissue. This can be 
achieved by, for example, the use of a polymeric, 
biodegradable microsphere, microparticle or microcapsule 
delivery vehicle, sized to optimize phagocytosis by 

5 phagocytic cells such as macrophages. For example, PLGA 
(poly-lacto-co-glycolide) microparticles approximately 1- 
10 (jun in diameter can be used. The polynucleotide is 
encapsulated in these microparticles, which are taken up 
by macrophages and gradually biodegraded within the cell, 

10 thereby releasing the polynucleotide. Once released, the 
DNA is expressed within the cell. A second type of 
microparticle is intended not to be taken up directly by 
cells, but rather to serve primarily as a slow-release 
reservoir of nucleic acid that is taken up by cells only 

15 upon release from the micro-particle through 

biodegradation. These polymeric particles should 
therefore be large enough to preclude phagocytosis (i.e., 
larger than Spun and preferably larger than 20pm) . 
Microparticles useful for nucleic acid delivery, methods 

20 for making them, and methods of use are described in 

greater detail in U.S. Patent No. 5,783,567, incorporated 
herein by reference in its entirety. Microparticles may 
also be made, for example, by the methods of Mathiowitz et 
al, (WO 95/2429), incorporated herein by reference in its 

25 entirety. 

Alternatively, a polynucleotide containing a 
"minigene" encoding the APL can be delivered in a 
pharmaceutically acceptable carrier such as saline, 
lipids, liposomes, particulates, virus-like particles, or 

30 nanospheres; as colloidal suspensions; or as powders. The 
nucleic acid can be naked or associated or complexed with 
a delivery vehicle- For a description of the use of naked 
DNA, see, e.g., U.S. Patent No. 5,693,622. Nucleic acids 
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and polypeptides can be delivered using delivery vehicles 
known in the art, such as ISCOMS, microspheres, 
microcapsules, microparticles, gold particles, virus--like 
particles, nanoparticles, polymers, condensing agents, 

5 polysaccharides, polyamino acids, dendrimers, saponins, 
adsorption enhancing materials, or fatty acids. Viral 
particles can also be used, e.g., retroviruses, 
adenovirus, adeno-associated virus, pox viruses, SV40 
virus, alpha virus or herpes viruses. 

10 The vectors can be incorporated alone into these 

delivery vehicles or co-incorporated with tissue-specific 
antibodies- Alternatively, one can prepare a molecular 
conjugate composed of a plasmid or other vector attached 
to poly-L-lysine by electrostatic or covalent forces. 

15 Poly-L-lysine binds to a ligand that can bind to a 

receptor on target cells [Cristiano et al. (1995), J. Mol. 
Med. 73:479]. Alternatively, lymphoid tissue specific 
targeting can be achieved by the use of lymphoid tissue- 
specific transcriptional regulatory elements (TRE) such as 

20 a B lymphocyte, T lymphocyte, or, optimally, dendritic 

cell specific TRE. Lymphoid tissue specific TRE are known 
[Thompson et al. (1992), Mol, Cell. Biol. 12:1043-1053; 
Todd et al. (1993), J. Exp. Med. 177:1663-1674; Penix et 
al. (1993), J. Exp. Med. 178:1483-1496]. 

25 The nucleic acid sequence encoding an APL of interest 

with an initiator methionine and optionally a trafficking 
sequence is operatively linked to a promoter or enhancer- 
promoter combination in the expression vector. 

Short amino acid sequences can act as signals to 

30 target proteins to specific intracellular compartments. 
For example, hydrophobic signal peptides 
(e.g., MAISGVPVLGFFIIAVLMSAQESWA, SEQ ID NO: 8) are 
typically found at the amino terminus of proteins destined 
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for the ER. The sequence KFERQ (SEQ ID NO: 9) (and other 
closely related sequences) is known to target 
intracellular polypeptides to lysosomes, while other 
sequences 

5 (e.g., MDDQRDLISNNEQLP, SEQ ID N0:10) target polypeptides 
to endosomes. In addition, the peptide sequence KDEL (SEQ 
ID NO: 11) has been shown to act as a retention signal for 
the ER. Each of these signal peptides, or a combination 
thereof, can be used to traffic the APL of the invention 

10 as desired. For example, a construct encoding a given APL 
linked to an ER-targeting signal peptide would direct the 
peptide to the ER, where it would bind to the class II MHC 
molecule as it is assembled, preventing the binding of 
intact invariant chain (li) which is essential for 

15 trafficking. Alternatively, a construct can be made in 

which an ER retention signal on the APL would help prevent 
the class II MHC molecule from ever leaving the ER. If 
instead an APL of the invention is targeted to the 
endosomic compartment, this would ensure that large 

20 quantities of the APL are present at the site of class II 
MHC to peptide complexing, thereby increasing the 
likelihood that the peptide incorporated into the class II 
MHC complex is the APL of the invention rather than a 
naturally-occurring, irrelevant peptide. The likelihood 

25 of APL being available for incorporation into class II MHC 
can be increased by linking the APL to an intact li 
polypeptide sequence- Since li is known to traffic 
class II MHC molecules to the endosomes, the hybrid li 
would carry one or more copies of the APL along with the 

30 class II MHC molecule; once in the endosome, the hybrid li 
would be degraded by normal endosomal processes to yield 
both multiple copies of the APL (or peptides similar to 
it), and an open class II MHC peptide binding cleft. DNAs 
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encoding APL linked to targeting signals can be generated 
by PGR or other standard genetic engineering or synthetic 
techniques. Trafficking sequences are described in 
greater detail in U.S. Patent No. 5,827,516 incorporated 

5 herein by reference in its entirety. 

A promoter is a TRE composed of a region of a DNA 
molecule, typically located within 100 nucleotide pairs 
upstream of the point at which transcription starts. 
Enhancers provide expression specificity in terms of time, 

10 location, and level. Unlike a promoter, an enhancer can 
function when located at variable distances from the 
transcription initiation site, provided a promoter is 
present. An enhancer can also be located downstream of 
the transcription initiation site. The coding sequence of 

15 the expression vector is operatively linked to a 
transcription terminating region. 

Suitable expression vectors include plasmids and 
viral vectors such as herpes viruses, retroviruses, 
vaccinia viruses, attenuated vaccinia viruses, canary pox 

20 viruses, adenoviruses and adeno-associated viruses, among 
others . 

Polynucleotides can be administered in a 
pharmaceutically acceptable carrier. Pharmaceutically 
acceptable carriers are biologically compatible vehicles 

25 which are suitable for administration to a human, e.g., 
physiological saline. A therapeutically effective amount 
is an. amount of the polynucleotide which is capable of 
producing a medically desirable result in a treated 
animal. As is well known in the medical arts, the dosage 

30 for any one patient depends upon many factors, including 
the patient's size, body surface area, age, the particular 
compound to be administered, sex, time and route of 
administration, general health, and other drugs being 
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administered concurrently. Dosages will vary, but a 
preferred dosage for administration of polynucleotide is 
from approximately 10^ to 10^^ copies of the polynucleotide 
molecule. This dose can be repeatedly administered, as 
5 needed. Routes of administration can be any of those 
listed above. 

4.2 Ex Vivo Approaches 

In one ex vivo approach, lymphoid cells, including 

10 CD4'*' T lymphocytes, are isolated from the subject and 
exposed to the APL in vitro. The lymphoid cells can be 
exposed once or multiply (e.g., 2, 3, 4, 6, 8, or 10 
times) . The pattern of cytokine production by the 
lymphoid cells can be tested after one or more exposures. 

15 Once the desired cytokines are being produced by the 

lymphoid cells, they are reintroduced into the subject via 
any of the routes listed herein. The therapeutic or 
prophylactic efficacy of this ex vivo approach is 
dependent on the ability of the ex vivo APL-activated 

20 lymphocytes to actively suppress a pathogenic CD4'*' T cell 
response to the parental wild-type peptide. The potential 
value of such an approach is indicated by experiments in 
which CD4*** T cells producing Th2- (or ThO- or Th3-)type 
cytokines actively suppressed ongoing Thl responses and 

25 disease caused by such Thl responses [Nicholson and 
Kuchroo (1996), Curr. Opinion in Immunol, 8:837-842]. 

An alternative ex vivo strategy can involve 
transfecting or transducing cells obtained from the 
subject with a polynucleotide containing the APL-encoding 

30 minigenes described above. The transfected or transduced 
cells are then returned to the subject. While such cells 
would preferably be lymphoid cells, they could also be any 
of a wide range of types including, without limitation. 
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fibroblasts, bone marrow cells, macrophages, monocytes, 
dendritic cells, epithelial cells, endothelial cells, 
keratinocytes, or muscle cells in which they act as a 
source of the APL for as long as they survive in the 
5 subject. The use of lymphoid cells would be particularly 
advantageous in that such cells would be expected to home 
to lymphoid tissue (e.g., lymph nodes or spleen), and thus 
the APL would be produced in high concentration at the 
site where they exert their effect, i.e., activation of an 

10 immune response. By using this approach, active in vivo 
immunization with the APL is achieved. The same genetic 
constructs and trafficking sequences described for the in 
vivo approach can be used for this ex vivo strategy. 

The ex vivo methods include the steps of harvesting 

15 cells from a subject, culturing the cells, transducing 

them with an expression vector, and maintaining the cells 
under conditions suitable for expression of the APL. 
These methods are known in the art of molecular biology. 
The transduction step is accomplished by any standard 

20 means used for ex vivo gene therapy, including calcium 
phosphate, lipofection, electroporation, viral infection, 
and biolistic gene transfer. Alternatively, liposomes or 
polymeric microparticles can be used. Cells that have 
been successfully transduced are then selected, for 

25 example, for expression of the minigene or of a drug 
resistance gene. The cells may then be lethally 
irradiated (if desired) and injected or implanted into the 
patient. 

These methods of the invention can be applied to any 
30 of the diseases and species listed here. Methods to test 
whether an APL is therapeutic for or prophylactic against 
a particular disease can be simple modifications of the 
above-described methods for establishing the type of CD4**" 
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T lymphocyte response elicited by a particular APL. Where 
a therapeutic, effect is being tested, a test population 
displaying symptoms of the disease {e.g., cancer patients) 
is treated with a test APL, using any of the above 

5 described strategies. A control population, also 

displaying symptoms of the disease, is treated, using the 
same methodology, with a placebo. Disappearance or a 
decrease of the disease symptoms in the test subject would 
indicate that the APL was an effective therapeutic agent. 

10 By applying the same strategies to subjects prior to 

onset of disease symptoms (e.g., experimental animals in 
which an appropriate disease can be deliberately induced, 
e.g., tumor model), APL can be tested for efficacy as 
prophylactic agents, i.e., vaccines. In this situation, 

15 prevention of onset of disease symptoms is tested. 

The following examples are meant to illustrate, not 
limit, the invention. 

Example 1. Identification of HLA Class II Binding CEA 

20 Peptide Epitopes 

Materials and Methods 

Epstein-Barr Virus (EBV)- Transformed B cell Lines 
expressing CEA encoding cDNA . EBV- trans formed B 
lymphocyte lines, JY and A2.140, were propagated in 

25 RPMI 1640 medium supplemented with glutamine, 

penicillin/streptomycin and 10% fetal calf serum (FCS) . 
CEA-encoding cDNA was inserted into the expression vector 
pcDNA3.1-Zeo (Invitrogen, Carlsbad, CA) at the EcoR V and 
Xba I restriction sites to give pcDNA3 . 1-CEA. pcDNAS.l- 

30 Zeo has a CMV promoter and a zeomycin selection marker. 
The sequence of the CEA-encoding cDNA (SEQ ID NO: 12) 
inserted into the vector is shown in Fig. lA and the amino 
acid sequence (SEQ ID NO: 17) of CEA encoded by the cDNA is 
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Shown in Fig. IB. pcDNA3,l-CEA and pcDNA3.1 (a negative 
control vector which was pcDNA3.1-Zeo without a protein- 
encoding insert) were linearized and both transfected by 
electroporation into JY and A2.140 cells. Stably 
5 transfected cells were selected with Zeonycin (50 ]xg/ml; 
sold as Zeocin™) and the transfected cells were cloned by 
limiting dilution. All surviving clones were then tested 
for CEA expression using the murine mAb COL-1 (specific 
for human CEA) by western blot and FACS analysis. Three 

10 high-expressing CEA clones were selected for further 
analysis (JY-CEA-9, A2. 140-CEA-15 and A2 , 140-CEA-38) . 
Fig. 2 shows the cell surface expression of CEA on JY-CEA- 
9 cells as detected by FACS analysis. 

Culture of EBV-Trans formed B Cell Lines . EBV- 

15 transformed B lymphocyte lines were propagated in 
RPMI 1640 medium supplemented with glut amine, 
penicillin/streptomycin and 10% fetal calf seriam (FCS) in 
2 L roller bottles to a density of -10^ cells/mL. JY cells 
and transfectants produced using them (e.g., JY-CEA-9) 

20 have the HLA-DRB*0401/HLA-DRB1*1301 genotype. A2.140 and 
transfectants produced using them (e.g., A2 , 140-CEA-15 and 
A2.140-CEA-38) have the HLA-DRB1*0101 genotype. 

HLA class II purification . Cells were harvested and 
25 pelleted by centrifugation. The cell pellets were weighed 
to determine the cellular mass and then frozen at -SO^C 
prior to lysis. Each cell pellet was resuspended in lysis 
buffer (2 ml per gram of pellet) containing 1% CHAPS, 500 
mM NaCl, 20 mM Tris-OH pH 8.0, in Milli-Q™* reverse osmosis 
30 quality (about 18.2 ]xQ) water containing freshly added 
protease inhibitors (100 nM iodoacet amide, 8 [xg/ml 
aprotinin, 10 |xg/mL leupeptin, 10 |xg/mL pepstatin A, 5 mM 
EDTA, 0.04 % sodium azide, 1 mM PMSF) , and the cells were 
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lysed by gentle agitation for 1 hr at 4°C on a rotor 
table. The resulting cell lysate was sedimented by ultra- 
centrifuge for 1 hr at 230,000 x g at 4^C {37,500 RPM on a 
SW41 TI rotor) . The insoluble material was removed by 

5 sedimentation at 175,000 x g for 2 hours, and the soluble 
supernatant fraction used for subsequent HLA purification. 
Multi-modal protein purification using HPLC columns was 
achieved by coupling the chromatographic sorbents in 
series with automated switching valves that direct the 

10 class II HLA-peptide complex containing effluent to 
subsequent columns in the sequences. The first three 
coupled columns were connected directly in series and 
acted together as a single pre-clearing column using high 
strength large throughpore perfusion sorbents (6000-8000 A 

15 throughpores and 500-1000 A diffusive pores, 50 jim) coated 
and crosslinked with a hydrophilic stationary phase and 
covalently conjugated with Protein A as the sorbent. 
These columns were designed to remove those proteins that 
non-specifically bind to the sorbents. Column 1 contained 

20 unmodified Protein A sorbent, column 2 contained Protein A 
coated with normal mouse serum, and column 3 was Protein A 
coated with bovine serum. The pre-clearing columns were 
followed by an immunoaf f inity column of Protein A coupled 
with mAb specific for a non-polymorphic determinant on 

25 HLA-DR molecules (A-LB3.1). After passing the lysate 
through the immunoaf f inity column, the column was 
extensively washed with 50 column volumes of 0.1 % 
CHAPS/500 mM NaCl/0.05 % sodium azide/20 mM Tris-OH pH 
8.0, followed by 50 column volumes of 0.1 % DOC/20 mM 

30 Mops/280 mM NaCl/0.05 % sodium azide pH 8.0, and finally 
100 column volumes of 0.1 % DOC/0,05 % sodium azide/10 mM 
Tris-OH pH 8.0. The HLA- DR-pep tide complexes were eluted 



- 57 - 



wo 02/22803 



PCT/USOl/28467 



from the immunoaf finity support using 3.5 column volumes 
of 50 mM carbonate/ 0.1% DOC/0.05% NaNa at pH 11. 5. 

Peptide Analysis 
5 The HLA class II protein samples can be concentrated 

by various methods. In one such method, the HLA class II 
protein sample eluted from the immunoactivity column was 
concentrated to 100 jxl using an ultrafiltration device 
(Amicon Centricon 10) prior to peptide extraction. 

10 Naturally processed peptide mixtures were acid eluted from 
HLA class II molecules by adding 800 ^1 10 % acetic acid 
and incubating for 15 minutes at 70 °C, as previously 
described [Chicz et al. (1993), J. Exp. Med. 178:24-47]. 
The peptides were separated from the remaining HLA protein 

15 by ultrafiltration with an Amicon Centricon lO'** device. 
The "flow-through" fraction containing the acid-extracted 
peptides was concentrated on a Savant SpeedVac™ 
centrifugal vacuum concentrator to a volume of 
approximately 20-30 \xl and stored at -80°C. The acid- 

20 extracted peptide mixtures were then separated by reverse 
phase chromatography as previously described [Chicz et al. 
(1993), supra] but with minor modifications. Briefly, 
peptide solutions were preconcentrated by trapping the 
peptides using a small bed (0.5 - 3.0 |iL bed volume) of 

25 polymeric reversed phase support. This also facilitates 
removal of hydrophilic contaminants that may be in the 
sample by washing the trap with a suitable aqueous 
solution (e.g., the buffers used for the chromatographic 
separation of the isolated peptides) . Subsequently, 

30 peptides were back-flushed from the trapping phase to a 
microbore C18 column (1.0 x 250 mm; Vydac, Hesperia, CA) , 
and peptide fractionation was performed at a flow rate of 
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50 |il/minute. The column effluent was collected and 
stored at -20°C prior to analysis by mass spectrometry. 

An alternative approach was to concentrate HLA 
class II molecule samples and extract the associated 

5 peptides using solid phase extraction (SPE) . In this 
method, the HLA class II molecule samples were 
concentrated using a protein capture column that contained 
a suitable HPLC stationary phase. Appropriate HPLC 
stationary phases include a reversed phase (preferably a 

10 C18 or a polymeric C18-like reversed phase) . However, a 
cation or anion exchange resin (including both strong, 
weak and mixed bed ion exchange phases) or other 
substrates that exhibit high affinity for protein 
complexes while enabling the elution of the HLA class II 

15 peptides, are equally suitable. Suitable dimensions of 
the protein capture column are 1-10 mm internal diameter 
and 2-10 cm long. Once the protein was bound to the 
capture column, the column was washed with suitable 
solvents (e.g., an aqueous buffer containing 5-25 mM Tris 

20 base at pH 7 • 5-8.5 followed by an aqueous solution of 

trifluroacetic acid (0.05-0.2% v/v) ) to remove hydrophilic 
contaminants. The HLA class II molecule-peptide complexes 
were next disrupted by the action of a suitable solvent 
that also facilitated the elution of peptides from the 

25 protein capture column. An appropriate solvent for this 
purpose is a mixture of acetonitrile (5-25% v/v), TFA 
(0.05-5% v/v) in water. The eiuted peptides were 
collected in a suitable container (e.g., an Eppendorf^ 
microfuge tube) and concentrated using a Savant Speedvac™ 

30 centrifugal vacuum concentrator. This step lowered the 
acetonitrile concentration and thereby permitted peptide 
fractionation by reverse phase chromatography (as 
described above) . All peptide samples were stored at - 
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20 °C prior to analysis by mass spectrometry. 
Alternatively, the eluted peptides were mixed post-elution 
from the protein capture column with an aqueous solution 
of TFA {typically 0.05-0.2% TFA in water) in order to 
5 lower the acetonitrile concentration and thereby permit 
peptide adsorption onto a peptide capture column. The 
peptide capture column contained an HPLC phase (e.g., a 
reversed phase resin such as C18 or polymeric equivalent 
and an ion exchange resin or other suitable phase that 

10 exhibits high affinity towards peptides) . Suitable 

dimensions of the peptide capture column are 0.5-4.6 mm 
internal diameter and 1-10 cm long. Following the 
adsorption of peptides, the peptide capture column was 
washed with a suitable solvent (e.g., 0-5% acetonitrile, 

15 0.05-0.2% TFA in water) prior to peptide fractionation by 
reversed phase chromatography (as described above) . All 
peptide samples were stored at -20 °C prior to analysis by 
mass spectrometry. 

Peptide samples were prepared for matrix assisted 

20 laser desorption time-of-f light (MALDI-TOF) mass 

spectrometry analysis by adding 1.0 iiL of each sample to a 
spot on the sample plate. A matrix solution (0.5 (xL of a 
solution of a-cyano-4-hydroxycinnamic acid (10 mg/ml) in 
50% acetonitrile/0.1 % trif luoroacetic acid) was added to 

25 the dry sample spots and allowed to air dry. The matrix 
solution also contained des-arg bradykinin and glu- 
fibrinopeptide B (300 fmol/p.L of each) of each as internal 
mass scale calibrants. Mass spectra were obtained at 
optimum laser intensities by averaging the ion signals 

30 from 50 to 256 individual scans in both linear and 
reflector modes using a single stage extended length 
reflector time-of-f light mass spectrometer (Voyager Elite 
XL; PerSeptive Biosystems, Framingham, MA) , 
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An automated microcapillary liquid chromatography- 
itiass spectroscopy (LC-MS) approach with either targeted or 
data dependent collision-assisted dissociation (CAD) was 
used to sequence low levels of naturally processed HLA 
5 associated peptides that were isolated by the MALDI-TOF-MS 
approach previously described. Peptide fractions 
separated by reversed phase chromatography were diluted to 
a final volume of 5-20 |iil to aid handling and permit' 
second dimension reversed phase separations. Each diluted 

10 peptide solution was then concentrated by trapping 
peptides in a small bed (0,5 - 1.0 p.L bed volume) of 
polymeric reversed phase support. This step also 
facilitated removal of hydrophilic contaminants by washing 
the trap with a suitable aqueous solution (e.g., mobile 

15 phase A as used in the chromatographic separation of 

isolated peptides) . The peptides were then back flushed 
from the trapping phase onto the microcapillary (with an 
inner diameter of 75 [m and packed with 3-10 cm of 1-7 |im 
100-200 A Ci8 or non-porous material) and separation was 

20 developed using a non-linear gradient of conventional 
mobile phases for peptide separations (typically 
combinations of water and acetonitrile containing a 
suitable ion pair reagent) . A mobile phase flow rate of 
0.15-0.20 )xL/min was achieved by splitting the flow from 

25 the pumps and using a backpressure regulator. Peptide 
detection was by |i-electrospray mass spectrometry. The 
voltage necessary to drive the electrospray was applied at 
the head of the microcapillary column and peptides were 
electrosprayed into the mass analyzer directly as they 

30 eluted from the column. CAD experiments were either 
predetermined to conduct specific target analyses or 
triggered in a data dependent mode, using ions that were 
more abundant than a user-set threshold. Dynamic 
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exclusion was used in conjunction with data dependent 
analyses to ensure maximum peptide coverage (i.e., minor 
responses were analyzed by CAD following a user-determined 
nvunber of CAD experiments of a single peptide response) by 
5 writing an exclusion list during assay progression so that 
a given ion will not be analyzed by multiple CAD 
experiments. The time that a given ion resides on the 
exclusion list was dependent upon the quality of the 
chromatographic separations. This time is determined 
10 experimentally. In this way, separated isobaric responses 
may be analyzed. Peptide sequencing sensitivities greater 
than 100 attomoles were achieved using this method. 

Results 

15 A cDNA sequence encoding the tumor antigen CEA {SEQ 

ID NO: 12) was cloned into a mammalian expression vector 
(pcDNA3.1-Zeo) and was transfected into and expressed in 
JY EBV- trans formed B lymphocytes and A2.140 EBV- 
transformed B lyii^hocytes using the methods described 

20 above. Transfectants were selected using Zeocin and the 
resulting stable transfectants were cloned by limiting 
dilution. Six control clones stably transfected with the 
pcDNA3.1-Zeo vector without an expressible insert 
("parental vector") and 43 CEA expressing transfectant 

25 clones were generated with JY cells. Six control clones 
stably transfected with the parental vector and 7 CEA 
expressing transfectant clones were generated with A2.140 
cells. All clones were then tested for CEA expression by 
western blot and FACS analysis using the murine mAb Col-1. 

30 Three CEA high- expressing clones (one from JY cells and 
two from A2.140 cells) were selected for further analysis 
(JY-CEA-9, A2.140-CEA-15 and A2 . 140-CEA-38) . Two control 
clones, one each from JY (clone JY-pcDNA3-lG9) and A2.140 
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cells- (clone A2. 140-pcDNA3) were also selected for peptide 
repertoire comparison. 

After growing the cells to a density of --10^ cells/mL, 
each preparation was harvested by centrifugation. JY- 
5 pcDNA3-lG9 was harvested from 10.8 L of medium to yield 
24.0 grains of cell pellet. JY-CEA-9 was harvested from 
10-8 L of medium to yield 23.6 grams of cell pellet. 
A2. 140-pcDNA3 was harvested from 20 L of medium to yield a 
41.0 gram pellet. A2. 140-CEA-15 was harvested from 14.4 L 
10 of medium to yield a 33.0 gram pellet. A2. 140-CEA-38 was 
harvested from 14.4 L of medium to yield a 30.2 gram 
pellet. 

Protein purification was accomplished by 
immunpaffinity chromatography as described above. HLA-DR 

15 molecules purified from JY are heterogenous and include 
both HLA-DR4 and HLA-DR13 molecules. The JY-pcDNA3-lG9 
HLA-DR yield from the Protein-A-LB3 . 1 column was 5.8 mg. 
The JY-CEA-9 HLA-DR yield from the Protein-A-LB3 . 1 column 
was 5.0 mg. The only HLA-DR molecules A2.140 cells 

20 express are HLA-DRl molecules. The A2 . 140-pcDNA3 HLA-DRl 
yield from the Protein-A-LB3 . 1 column was 12.0 mg. The 
A2.140-CEA-15 HLA-DRl yield from the Protein-A-LB3 . 1 
column was 12.5 mg. The A2.140-CEA-38 HLA-DRl yield from 
the Protein-A-LB3 . 1 column was 8.0 mg. 

25 Peptides were eluted from each HLA-DR protein 

preparation and separated by RP-HPLC, and each of the 
100 fractions collected was analyzed by MALDI-TOF. 
RP-HPLC analysis was highly reproducible, with 
chromatographic traces from the CEA-expressing and control 

30 CEA-non-expressing HLA-DR4/13 preparations showing 

similarity to each other. Similarly, the traces obtained 
with peptides from CEA-expressing and control CEA non- 
expressing HLA-DRl preparations showed similarity to each 
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other. The chromatographic traces of the peptide 
repertoires from HLA-DR4/13-expressing JY cells and HLA- 
DRl-expressing A2.140 cells were distinguishable from each 
other- A subtract ive approach was used to identify CEA- 

5 derived peptides. Mass spectra of equivalent RP-HPLC 
fractions from the A2. 140-pcDNA3 and A2.140-CEA-38 
peptides preparations were overlaid and masses common to 
both were discounted from further analysis. An example of 
such a profile is shown in Fig. 3. In Fig. 3, while peaks 

10 with m/z values of about 1704.8, 1805.9 and 1899.9 were 
seen in the spectra obtained with peptide mixtures from 
both A2.140-CEA-38 and A2 . 140-pcDNA3 cells, a peak with a 
m/z value of 1801.9 was seen only in the spectrum obtained 
with the peptide mixture from A2 . 140-CEA-38 cells. 

15 Similar MALDI-TOF analyses were performed for the peptide 
repertoires isolated from JY-CEA-9 and JY-pcDNA3-lG9 
control cell lines as well as A2 . 140-CEA-15 and A2.140- 
pcDNA3 cell line control. 

Of the approximately 40,000 m/z values observed, 

20 165 novel masses were initially identified as potential 
naturally processed peptides from CEA. Subsequent mass 
analyses using MS/MS fragmentation were performed on 278 
of the 500 RP-HPLC fractions as described. Approximately 
50,000 MS/MS experiments were performed in both data 

25 dependent and targeted mass analyses. Six CEA peptides 
from the JY-CEA-9 cell line (SEQ ID N0S:l-6) were 
identified and sequenced (Figs. 4-9) . One CEA-derived 
peptide was found in both the A2 . 140-CEA-15 and A2.140- 
CEA-38 cell lines (SEQ ID N0:7) (Fig, 10). 

30 Thus, the described IMF method applied to the 

analysis of peptides produced by natural processing of CEA 
identified seven peptides that are associated with HLA- 
DR4/13 and HLA-DRl. This knowledge provides the basis for 
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the development of therapeutic and/or prophylactic agents 
against CEA associated cancers^ e.g., colon cancer. It is 
expected that analogous methodologies can be similarly 
successful in identifying other class II MHC-restricted 
5 timor antigen peptides that activate T cells and are 

involved in the CD4"^ T lymphocyte-mediated pathogenesis of 
other diseases (see above) in which susceptibility is 
linked to the expression of a particular class II MHC 
molecule • 

10 Three additional peptides (SEQ ID NOS: 13-15) from 

the JY-CEA-9 cell line were subsequently identified and 
sequenced (Figs. 10-12), 

Epitope Verification (EV) ; 

15 To confirm that peptide epitopes identified are 

relevant to cancer (i.e., that they are recognized by 
T cells of patients with cancer) , T cell proliferation 
assays can be carried out using synthetic peptides having 
amino acid sequences based upon the sequence of the 

20 peptides identified by mass spectrometry to be derived 
from CEA (e.g., the peptides with SEQ ID NOS: 1-7 and IS- 
IS) . Peptides can be synthesized using Fmoc chemistry and 
purified by RP-HPLC. The amino acid sequences and purity 
of greater than 90% for all the synthetic peptides can be 

25 confirmed by MALDI-MS and analytical HPLC. Peripheral 

blood mononuclear cells from recent onset cancer patients 
(and healthy controls expressing the appropriate HLA-DRl 
and HLA-DR4 molecules) can be separated by density 
gradient centrifugation and co-cultured in wells of 96- 

30 well U-bottom plates with peptides at a concentration of 
10 |ig/ml for 5 days in 150 jil RPMI 1640/10% pooled normal 
AB serum, followed by pulsing with 0,5 (iCi [^H]- 
thymidine/well and harvesting onto filters for 



- 65 - 



wo 02/22803 



PCTAJSOl/28467 



radioactivity counting measured in counts per minute 
(cpm) . There are generally twelve replicate wells per 
test group. Results are expressed as a stimulation index 
(SI) which is the ratio of the counts per minute (cpm) 
5 obtained in samples from cultures containing peptide to 
the cpm obtained in samples from cultures without peptide 
(mean cpm of 12 wells in each case) . The data are also 
analyzed In terms of the fraction of "positive culture 
wells." A positive culture well is one that contained 
10 peptide and resulted in cpm > mean cpm + 2SD obtained from 
cultures without peptide. T cell responses are considered 
significant when the SI is >2-0 and >40% wells are 
positive. 

Data obtained from the above EV analysis are expected 
15 to confirm that the peptides identified by the IMF method 
are recognized specifically by CD4^ T lymphocytes from HLA- 
DRl and HLA-DR4 expressing cancer patients and thus may be 
implicated in immunity to the cancer. 

Example 2. Binding of consensus peptides to isolated HLA- 
DRl and HLA-DR4 molecules 

Synthetic peptides with amino acid sequences based on 
the 3 core regions identified by the IMF can be tested for 
their ability to bind to isolated HLA-DRl and HLA-DR4 
molecules in binding inhibition assays performed 
essentially as previously described [Chicz et al. (1997), 
J. Immunol. 159: 4935-4942]. In brief , aliquots of 
immunopurified preparations of HLA-DRl and HLA-DR4 (final 
concentration of 10 jig/ml) are incubated with a 
biotinylated HLA-DR specific binding peptide (consisting 
of residues 98-117 of class II MHC invariant chain) ("the 
indicator peptide") (1 pM) and varying concentrations of 
the test peptides in 0.2 ml tubes. After an overnight 
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incubation at room temperature, the contents of each tube 
are transferred to a well of a 96-well plastic microtiter 
plate precoated with anti-HLA-DR antibody. The microtiter 
plates are rocked for 60 min at room temperature and 
5 unbound material is removed by rigorous washing- The 
relative amount of bound standard peptide in each well 
will be determined by measuring color development after 
addition of streptavidin-conjugated alkaline phosphatase, 
washing, and adding a chromogenic alkaline phosphatase 
10 substrate. 

Other Embodiments 

The invention also features the following 
embodiments . 

15 Methods of Use 

The peptides or APL of the invention can be used to 
activate CD4"*" memory T cells specific for CBA by in vitro 
methods known to those in the art. After a patient 
undergoes surgery to remove a tumor, such CEA-specific CD4'*" 

20 memory T cells may be administered to establish tumor- 
specific, long-lasting immunity against tumor rechallenge. 
Topalian [Current Opinion in Immunology (1994), 6:741-745] 
reviews the role of anti-tumor 004"*" T cells in 
immunological memory. Studies have demonstrated that 

25 tumor-specific immunological memory can be established by 
vaccination" with IL-2-transduced tumor cells, and CD4* T 
cells are a* critical component of this memory response. 

The peptides or APL of the invention can also be used 
to help class I MHC restricted (CD8*) CTL responses. For 

30 example, CDS* T cells recognizing the gag-L-encoded CTL 
epitope have been demonstrated to be effector cells that 
are efficiently activated with help from peptide- 
activated, tumor-specific CD4* T cells [Ossendorp et al. 
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(1998), J. Exp. Med, 187:693-702]. In addition, mature, 
monocyte-derived dendritic cells were used to elicit 
resistance to malignant melanoma. [Thurner et al. (1999), 
J. Exp. Med. 190:1669-1678]. Intracutaneous injections of 
5 Mage-3A1 peptide-pulsed mature dendritic cells enchanced 
Mage-3Al-specific CDS* CTL responses and activated 004"^ T 
cell activity in stage IV melanoma patients with large 
tumor loads. 

004* T cells have been shown to play a role in the 

10 induction and persistence of CD8"** T cells. A peptide 

epitope h-gplOO (melanocyte differentiation antigen which 
is expressed by more than 75% of melanomas) that binds to 
several class II MHC molecules was identified and 
characterized. The MHC class Il-restricted epitope, which 

15 was capable of inducing tumor-reactive 004"*" cells, could 
potentiate antitumor effector function and long term 
immunity [Touloukian et al. (2000), J. Immunol. 164:3535- 
3542] . The peptide was able to induce specifically the in 
vitro expansion of CD4'*' T cells. Thus, the peptides or APL 

20 of the invention could be used as immunogens, together 
with one or more CDS* T cell epitopes produced by 
appropriate CEA-expressing tumor cells. Interestingly in 
this regard, the human Melan-A/MART-1 protein that 
includes an HLA-A2-restricted peptide epitope recognized 

25 by melanoma-reactive CDS**" CTLs also contains at least one 
HLA-DR4-presented peptide recognized by 004* T cells 
[Zarour et al. (2000), Proc. Natl. Acad. Sci. USA 97:400- 
405] . 

The peptides or APL of the invention can also be used 
30 to activate cytokine production for the recruitment of 
tumoricidal eosinophils and macrophages- Hung et al. 
[(1998), J. Exp. Med. 188:2357-2368] showed that 
vaccination with irradiated tumor cells transduced to 



- 68 - 



wo 02/22803 



PCT/USOl/28467 



secrete granulocyte/macrophage colony-stimulating factor 
resulted in activation of CD4"^ T cells which, by the action 
of the cytokines that they produce, activated eosinophils 
and macrophages to produce superoxide and nitric oxide, 
5 both which have tijraioricidal activity. It will be possible 
to screen for a peptide or APL of the invention that will 
stimulate 004"*" T cells with such activity. 

Qin et al. [(2000), Immunity 12:677-686] showed that 
in vivo CD4'*' T cell mediated immunity to tumors was due to 
10 inhibition of tumor-induced angiogenesis, was dependent on 
IFN-Yf and required expression of the receptor for IFN-y 
on non-hematopoietic cells. Peptides or APL of the 
invention can be useful in activating 004"^ T cells with 
this ant i- tumor activity. 

15 

Reagents that Bind to Peptide-Class II MHC Complexes 

Reagents that bind to peptide-class II MHC complexes 
can be made, for example, by screening a phage display 
library in which the phage particles contain nucleic acid 

20 sequences encoding antibody fragments such as Fab or 

single chain Fv (scFv) fragments. Such libraries can be 
screened by testing for the presence of and isolating 
phage particles with the ability to bind to the peptides 
of the invention (e.g., KEVLLLVHNLPQH (SEQ ID N0:1), 

25 KEVLLLVHNLPQHL (SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID 
NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 
(SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID N0:6); 
YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL (SEQ ID 
NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID NO: 14); or 

30 NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15)), bound to a class II 
MHC molecule of interest. For example, phage display 
technology has been used to identify antibodies specific 
for defined HIiA-DR2 peptide complexes associated with 
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multiple sclerosis [Krogsfaard et al. (2000), J. Exp. Med. 
191:1395-1412]. Alternatively, polyclonal antibodies or 
mAb can be screened for their ability to bind to peptide- 
class II MHC complexes of interest. 
5 The above antibody-based reagents can be used, for 

example, in diagnosing cancer. For example, binding to a 
test cell of a Fab, scFv, or an antibody (e.g., a mAb) 
specific for a HIiA-DR4 molecule bound to a CEA peptide of 
the invention would indicate that the test cell is a 

10 cancer cell. Examples of detection agents used to detect 
binding of antibodies or antibody fragments include, 
without limitation, enzymes, radiolabels, luminescent 
compounds, and fluorescing compounds that elicit a 
detectable and measurable signal when the antibody 

15 complexes with the CEA naturally processed peptide. 
Examples of detectors include, without limitation, 
spectrophotometers , colorimeters , f luorometer s , 
luminometers and biacore machines. 

Therapeutic agents for treating cancer can be made, 

20 for example, by linking one of the above reagents with a 
therapeutic (e.g., cytotoxic) atom or molecule. An 
appropriate therapeutic agent, after administration (by 
any of the methods disclosed herein) to a subject with the 
relevant cancer, binds to the cancer cells. The 

25 therapeutic atom or molecule can then kill the cancer 
cell- Alternatively, the therapeutic agent is 
internalized by the cancer cell and then the therapeutic 
atom or molecule kills the cancer cell. The linkage 
between the reagent and therapeutic atom or molecule can 

30 be a covalent one or a relatively weak non-covalent one 
such that the complex dissociates after binding to the 
cancer cell surface. Examples of therapeutic atoms and 
molecules include chemotherapeutic compounds. 
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radioisotopes, and toxins, e.g., ricin or diptheria toxin, 
or toxic fragments of such toxins. 

Although the invention has been described with 
reference to the presently preferred embodiments, it 
5 should be understood that various modifications can be 
made without departing from the spirit of the invention. 
Accordingly, the invention is limited only by the 
following claims. 

10 What is claimed is: 
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1. A method of identifying a class II MHC-binding 
fragment of a polypeptide, the method comprising: 

(a) providing a mammalian antigen-presenting cell 
(RFC) cell line transfected with a DNA encoding a 

5 polypeptide; 

(b) isolating from the APC a class II MHC molecule 
bound to a peptide, wherein the peptide is a class II MHC- 
binding fragment of the polypeptide; 

(c) eluting the peptide from the class II MHC 
10 molecule; and 

(d) identifying the peptide as a fragment of the 
polypeptide, 

2. The method of claim 1, wherein the polypeptide 
has the sequence of a tumor antigen - 

15 3. The method of claim 1, wherein the APC is a 

dendritic cell- 

4. The method of claim 1, wherein the APC is a 
macrophage or a monocyte. 

5- The method of claim 1, wherein the APC is a 
20 B lymphocyte. 

6. The method of claim 1, wherein the mammal is a 
human. 

7. The method of claim 2, wherein the class II MHC 
molecule is selected from the group consisting of a HLA-DR 

25 molecule, a HLA-DQ molecule, and a HLA-DP molecule. 

8. The method of claim 2, wherein the class II MHC 
molecule is a DR molecule with a p-chain encoded by a gene 
selected from the group consisting of DRB1*0401, 
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DRB1*0101, DRB1*0301, DRB1*0701, DRB1*1101, DRB1*1301, 
DRB1*1302, DRB1*1501 and DRB5*0101. 

9. An isolated peptide fewer than 35 amino acid 
residues in length, comprising a sequence KEVLLLVHNLPQH 

5 (SEQ ID N0:1) . 

10. The isolated peptide of claim 9, comprising an 
amino acid sequence selected from the group consisting of: 
KEVLLLVHNLPQHL (SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID 

NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 
10 (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6) ; and 
EGKEVLLLVHNLPQHL (SEQ ID N0:13)- 

11. An isolated peptide fewer than 35 amino acid 
residues in length, comprising a sequence YLWWVNGQSLPVS.PR 
(SEQ ID N0:7) • 

15 12. An altered peptide ligand (APL) , the amino acid 

sequence of which is identical, except for 1-6 amino acid 
substitutions, to a fragment of carcinoembryonic antigen 
(CEA) , the fragment being fewer than 35 amino acids 
residues in length and comprising a sequence selected from 

20 the group consisting of KEVLLLVHNLPQH (SEQ ID N0:1), 

YLWWVNGQSLPVSPR (SEQ ID NO: 7), and NPPAQYSWLIDGNIQQH (SEQ 
ID NO: 14), wherein no more than 30% of the amino acid 
residues of the fragment are si±)stituted with different 
amino acid residues in the APL, and wherein the APL binds 

25 to a class II MHC molecule. 

13. The APL of claim 12, wherein the sequence of the 
fragment is selected from the group consisting of 
KEVLLLVHNLPQH (SEQ ID N0:1), YLWWVNGQSLPVSPR (SEQ ID 
N0:7), and NPPAQYSWLIDGNIQQH (SEQ ID N0:14). 
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14. A process for making an APL, the process 
comprising: 

(a) carrying out the method of claim 1, and 

(b) synthesizing an APL consisting of a sequence 

5 which is identical to that of the peptide, except having 
amino acid substitutions at 1, 2, 3, 4, 5, or 6 positions 
in the peptide. 

15. The process of claim 14, wherein the polypeptide 
is CEA. 

10 16. A method of activating T cell reactivity in a 

mammal, the method comprising: 

(a) providing (i) a peptide, the sequence of which 
consists of the sequence of a naturally processed fragment 
of CEA, wherein the peptide binds to a class II MHC 

15 molecule of the mammal and elicits a CD4"*" T cell response, 
or (ii) a DNA encoding a polypeptide selected from the 
group consisting of (1) the peptide, (2) the peptide plus 
an amino terminal methionine residue, and (3) either (1) 
or (2) linked to a trafficking sequence; and 

20 (b) administering the peptide or DNA to the mammal. 

17. The method of claim 16, wherein the peptide is 
fewer than 35 amino acid residues in length and comprises 
a sequence KEVLLLVHNLPQH (SEQ ID N0:1). 

18. The method of claim 17, wherein the peptide is 
25 KEVLLLVHNLPQH (SEQ ID N0:1). 

19. The method of claim 17, wherein the peptide is 
KEVLLLVHNLPQHL (SEQ ID NO: 2). 

20. The method of claim 17, wherein the peptide is 
KEVLLLVHNLPQHLF (SEQ ID NO: 3). 
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21. The method of claim 17, wherein the peptide is 
KEVLLLVHNLPQHLFG (SEQ ID NO: 4). 

22 The method of claim 11, wherein the peptide is 
GKEVLLLVHNLPQHL (SEQ ID NO: 5). 

5 23, The method of claim 17, wherein the peptide is 

EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6). 

24. The method of claim 16, wherein the peptide is 
fewer than 35 amino acid residues in length and comprises 
a sequence YLWWVNGQSLPVSPR (SEQ ID NO: 7). 

10 25. The method of claim 24, wherein the peptide is 

YLWWVNGQSLPVSPR (SEQ ID NO: 7), 

26. A method of altering a T cell response in a 
mammal, the method comprising: 

(a) providing (i) an APL having a sequence identical, 
except for amino acid substitutions at 1-6 positions, to 
the sequence of a naturally-processed fragment of CEA, 
wherein the APL binds to a class II MHC molecule of the 
mammal, or (ii) a DNA encoding a polypeptide selected from 
the group consisting of (1) the APL, (2) the APL plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence; and 

(b) administering the APL or DNA to the mammal. 

27. The method of claim 1, further comprising: 

(e) providing 004"^ lymphocytes from a mammal having a 
25 condition suspected of being associated with presentation 

of the peptide by the class II MHC molecule, wherein the 
APCs of the mammal bear the class II MHC molecule; 

(f ) providing a population of APCs that bear the 
class II MHC molecule with the peptide bound thereto; 
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(g) contacting the population of APCs of (f ) with the 
CD4"** lymphocytes of (e) ; and 

(h) determining whether the CD4"*" lymphocytes recognize 
the class II MHC-bound peptide^ as an indication that 

5 presentation of the peptide to 004**" T lymphocytes is 
associated with the condition. 

28- The method of claim 27, wherein said 
presentation is associated with a pathological response of 
CD4* T lymphocytes. 

10 29. The method of claim 27, wherein said 

presentation is associated with a protective response of 
004"** T lymphocytes. 

30- A method of diagnosis comprising: 

(a) providing a CD4^ lymphocyte from an individual 
15 suspected of having or being susceptible to cancer; 

(b) providing an APC which bears on its surface a 
class II MHC molecule of an allele identical to one 
expressed by said individual, wherein the class II MHC 
molecule is bound to a CEA peptide; 

20 (c) contacting the APC with the CD4'*' lymphocyte; and 

(d) determining whether the CD4*'* lymphocyte recognizes 

the class II MHC-bound peptide, as an indication that the 

individual has or is susceptible to cancer, 

wherein the peptide comprises an amino acid sequence 
25 selected from the group consisting of: KEVLLLVHNLPQH (SEQ 

ID N0:1); KEVLLLVHNLPQHL (SEQ ID NO: 2;) KEVLLLVHNLPQHLF 

(SEQ ID NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); 

GKEVLLLVHNLPQHL (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID 

NO: 6); YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL 
30 (SEQ ID NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID NO: 14); and 

NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 
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31- A method of treating a subject suspected of 
having or being susceptible to cancer, the method 
comprising administering the peptide of claim 9 to the 
subject- 

5 32 • A method of identifying a reagent for diagnosing 

cancer, the method comprising: 

(a) providing a test reagent selected from the group 
consisting of a Fab fragment, a monoclonal antibody (mAb) , 
and a single chain Fv (scFv) fragment; 

10 (b) providing a complex comprising a class II MHC 

molecule bound to a peptide comprising a sequence selected 
from the group consisting of KEVLLLVHNLPQH (SEQ ID N0:1), 
KEVLLLVHNLPQHL (SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID 
NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 

15 (SEQ ID N0:5); EGKEVLLLVHNLPQHLFG (SEQ ID N0:6); 

YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL (SEQ ID 
N0:13); NPPAQYSWLIDGNIQQH (SEQ ID N0:14); and 
NPPAQYSWLIDGNIQQRT (SEQ ID NO: 15); and 

(c) testing whether the test reagent binds to the 

20 complex, wherein binding to the complex is an indication 
that the test reagent is potentially useful for diagnosing 
cancer. 

33. The method of claim 30, wherein the class II MHC 
molecule is a DR molecule with a p-chain encoded by a 

25 DRB1*0401 gene, 

34. The method of claim 30, wherein the class II MHC 
molecule is a DR molecule with a p-chain encoded by a 
DRB1*0101 gene. 

35. A method of diagnosis, the method comprising: 
30 (a) providing a test cell from a mammalian subject; 
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(b) providing a reagent that binds to a CEA peptide 
fragment bound to a class II MHC molecule; 

(c) contacting the test cell with the reagent; and 

(d) detecting binding of the reagent to the test cell 
5 as an indication that the test cell is a cancer cell. 

36- A method of cancer therapy comprising: 

(a) providing a composition comprising a reagent 
selected from the group consisting of a Fab fragment, a 
mAb, and a scFv fragment, V7herein the reagent recognizes a 
naturally processed CEA peptide bound to a MHC class II 
molecule, the reagent being linked to an agent selected 
from the group consisting of a chemotherapeutic compound, 
a radioactive isotope and a toxin; and 

(b) administering the composition to a subject 
suspected of having or being susceptible to a cancer 
characterized by expression of CEA. 

37. A method of identifying a class II MHC-binding 
fragment of a tumor antigen, the method comprising: 

(a) providing a mammalian APC comprising a class II 
20 MHC molecule and the tumor antigen; 

(b) isolating from the APC the class II MHC molecule 
bound to a peptide, wherein the peptide is a class II MHC 
binding fragment of the tumor antigen; 

(c) eluting the peptide from the class II MHC 
25 molecule ; and 

(d) identifying the amino acid sequence of the 
peptide. 

38. The method of claim 37, further comprising: 

(e) providing CD4"*' lymphocytes from a mammal having a 
30 cancer suspected of being associated with presentation of 

the peptide by the class II MHC molecule, wherein the APCs 
of the mammal bear the class II MHC molecule; 
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(f) providing a population of APCs that bear the 
class II MHC- molecule with the peptide bound thereto; 

(g) contacting the population of APCs of (f) with the 
€04"^ lymphocytes of (e) ; and 

5 (h) determining whether the CD4^ lymphocytes recognize 

the class II MHC bound peptide, as an indication that 
presentation of the peptide to CD4* lymphocytes is 
associated with the cancer. 

39. An isolated DNA comprising a nucleotide sequence 
10 encoding a peptide fewer than 35 amino acids in length and 

comprising the sequence of SEQ ID N0:1, 7 or 14. 

40. A vector comprising the DNA of claim 39. 

41. The vector of claim 40, wherein the nucleotide 
sequence is operatively linked to a transcriptional 

15 regulatory element. 

42. A cell comprising the vector of claim 40, 

43. A cell comprising the vector of claim 41. 

44. An isolated peptide fewer than 35 amino acid 
residues in length, comprising a sequence 

20 NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 

45. The isolated peptide of claim 44, comprising a 
sequence NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 

46. The method of claim 17, wherein the peptide is 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13). 

25 47. The method of claim 16, wherein the peptide is 

fewer than 35 amino acid residues in length and comprises 
a sequence NPPAQYSWLIDGNIQQH (SEQ ID N0:14) . 
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48. The method of claim 47, wherein the peptide is 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 

49. The method of claim 47, wherein the peptide is 
NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 

5 50, The method of claim 2, wherein the tiimor antigen 

is CEA. 

51. The method of claim 2, wherein the tiomor antigen 
is selected from the group consisting of prostate specific 
antigen, MAGE 1-4, MAGE 6, MAGE 12, a mucin, tyrosinase, 
10 MART, Pmel 17, N-acetylglucoaminyltransf erase V intron V 
sequence. Prostate Ca psm, PRAME, p-catenin, MUM-l-B, 
GAGE-1, BAGE 2-10, C-ERB2, Epstein-Bar r Virus nuclear 
antigen 1-6, gp75, human papilloma virus E6 and E7, p53, 
lung resistance protein, Bcl-2, and Ki-67. 

15 52. The method of claim 16, wherein the mammal is 

suspected of having or being susceptible to cancer. 

53. An isolated peptide comprising a fragment of 
CEA comprising KEVLLLVHNLPQH (SEQ ID N0:1) and a further 
1-15 residues of CEA sequence at the C-terminus of SEQ ID 

20 N0:1, at the N-terminus of SEQ ID N0:1, or at both the N- 
terminus and the C-terminus of SEQ ID N0:1. 

54. An isolated peptide comprising a fragment of 
CEA comprising YLWWVNGQSLPVSPR (SEQ ID NO: 7) and a further 
1-15 residues of CEA sequence at the C-terminus of SEQ ID 

25 NO: 7, at the N-terminus of SEQ ID NO: 7, or at both the N- 
terminus and the C-terminus of SEQ ID NO: 7. 
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55, An isolated peptide comprising a fragment of 
CEA comprising NPPAQYSWLIDGNIQQH (SEQ ID NO: 14) and a 
further 1-15 residues of CEA sequence at the C- 

5 terminus of SEQ ID NO: 14, at the N-terminus of SEQ ID 

NO: 14, or at both the N-terminus and the C-terminus 
of SEQ ID NO: 14. 

56. A method of activating T cell reactivity in a 
mammal, the method comprising: 

10 (a) providing {i)the peptide of claim 53, or (ii) a 

DNA encoding a polypeptide selected from the group 
consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence; and 

15 (b) administering the peptide or DNA to the mammal . 

57, A method of activating T cell reactivity in a 
mammal, the method comprising: 

(a) providing (i)the peptide of claim 54, or (ii) a 
DNA encoding a polypeptide selected from the group 

20 consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence; and 

(b) administering the peptide or DNA to the mammal. 

58. A method of activating T cell reactivity in a 
25 mammal, the method comprising: 

(a) providing (i)the peptide of claim 55, or (ii) a 
DNA encoding a polypeptide selected from the group 
consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
30 (2) linked to a trafficking sequence; and 
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(b) administering the peptide or DNA to the mammal - 

59. An isolated fragment of CEA shorter than full- 
length CEA, wherein the fragment comprises one or more 
amino acid sequences selected from the group consisting of 

5 SEQ ID N0:1, SEQ ID NO: 7, and SEQ ID NO: 14. 

60. A method of activating T cell responsiveness 
in a mammal, the method comprising: 

(a) providing (i) the fragment of claim 59, or (ii) a 
DNA encoding a polypeptide selected from the group 

10 consisting of (1) the fragment, (2) the fragment plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence, wherein the DNA does 
not encode full-length CEA; and 

(b) administering the fragment or DNA to the mammal. 

15 61. A method of activating T cell responsiveness 

in a mammal, the method comprising: 

(a) administering to the mammal (i) CEA or (ii) a DNA 
encoding CEA; and 

(b) testing CD4+ T cells of the animal for 

20 responsiveness to a naturally processed fragment of CEA. 



62. An isolated peptide comprising: 

(a) at least one CEA segment selected from the group 
consisting of SEQ ID N0:1, SEQ ID NO: 7, and SEQ ID NO: 14; 

25 and 

(b) one or more amino acids at the C-terminus of the 
CEA segment, at the N-terminus of the CEA segment, or at 
both the C- terminus and the N-terminus of the CEA segment, 

wherein the peptide has an amino acid sequence that 
30 is not identical to that of full-length CEA. 
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63. A method of activating T cell reactivity in a 
manraial, the method comprising: 

(a) providing (i) the peptide of claim 62, or (ii) a 
DNA encoding a polypeptide selected from the group 

5 consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence, wherein the DNA does 
not encode full-length CEA; and 

(b) administering the peptide or DNA to the mammal. 

10 
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ATGGAGTCTCCCTCGGCCCCTCCCCACAGAfGGTGCATCCCCTGGCAGAGGCTCCTGCT 

CACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCAAGCTCACTATTGAAT 

CCACGCCGTTCAATGTCGCAGAGGGGAAGGAGGTGCTTCTACTTGTCCACAATCTGCCC 

CAGCATCTTTTTGGCTACAGCTGGTAC/yVAGGTGAAAGAGTGGATGGCAACCGTCAAAT 

TATAGGATATGTAATAGGAACTCAACAAGCTACCCCAGGGCCCGCATACAGTGGTCGAG 

AGATAATATACCCCAATGCATCCCTGCTGATCCAGAACATCATCCAGAATGACACAGGA 

TTCTACACCCTACACGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACT<3GCCAGTT 

CCGGGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAAACCCGTGG 

AGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACGCAACCTACCTG 

TGGTGGGTAAACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGCAA 

CAGGACCCTCACTCTATTCAATGTCACAAGAAATGACACAGCAAGCTACAAATGTGAAA 

CCCAGAACCCAGTGAGTGCCAGGCGCAGTGATTCAGTCATCCTGAATGTCCTCTATGGC 

CCGGATGCCCCCACCATTTCCCCTCTAAACACATCTTACAGATCAGGGGAAAATCTGAA 

CCTCTCCTGCCATGCAGCCTCTAACCCACCTGCACAGTACTCTTGGTTTGTCAATGGGA 

CTTTCCAGCAATCCACCCAAGAGCTCTTTATCCCCAACATCACTGTGAATAATAGTGGA 

TCCTATACGTGCCAAGCCCATAACTCAGACACTGGCCTCAATAGGACCACAGTCACGAC 

GATCACAGTCTATGCAGAGCCACCCAAACCCTTCATCACCAGCAACAACTCCAACCCCG 

TGGAGGATGAGGATGCTGTAGCCTTAACCTGTGAACCTGAGATTCAGAACACAACCTAC 

CTGTGGTGGGTAAATAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGA 

CAACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGATGTAGGACGCTATGAGTGTG 

GAATCCAGAACGAATTAAGTGTTGACCACAGCGACCCAGTCATCCTGAATGTCCTCTAT 

GGCCCAGACGACCCCACCATTTCCCCCTCATACACCTATTACCGTCCAGGGGTGAACCT 

CAGCCTCTCCTGCCATGCAGCCTCTAACCCACCTGCACAGTATTCTTGGCTGATTGATG 

GGAACATCCAGCAACACACACAAGAGCTCTTTATCTCCAACATCACTGAGAAGAACAGC 

GGACTCTATACCTGCCAGGCCAATAACTCAGCCAGTGGCCACAGCAGGACTACAGTCAA 

GACAATCACAGTCTCTGCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAAAC 

CCGTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGGCTCAGAACACAACC 

TACCTGTGGTGGGTAAATGGTCAGAGCCTCCCAGTCAGTCCCAGGCTGCAGCTGTCCAA 

TGGCAACAGGACCCTCACTCTATTCAATGTCACAAGAAATGACGCAAGAGCCTATGTAT 

GTGGAATCCAGAACTCAGTGAGTGCAAACCGCAGTGACCCAGTCACCCTGGATGTCCTC 

TATGGGCCGGACACCCCCATCATTTCCCCCCCAGACTCGTCTTACCTTTCGGGAGCGAA 

CCTCAACCTCTCCTGCCACTCGGCCTCTAACCCATCCCCGCAGTATTCTTGGCGTATCA 

ATGGGATACCGCAGCAACACACACAAGTTCTCTTTATCGCCAAAATCACGCCAAATAAT 

AACGGGACCTATGCCTGTTTTGTCTCTAACTTGGCTACTGGCCGCTVATAATTCCATAGT 

CAAGAGCATCACAGTCTCTGCATCTGGAACTTCTCCTGGTCTCTCAGCTGGGGCCACTG 

TCGGCATCATGATTGGAGTGCTGGTTGGGGTTGCTCTGATATAG 
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Fig. IB 

MESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTffiSTPFNVAEGKEVLLLV 

HNLPQHLFGYSWYKGERVDGNRQnOYVIGTQQATPGPAYSGREnYPNASLLIQN 

nqi^TGFmJWIKSDLVNEEATGQFRVYPEIJKPSISSNNSKPVEDKDAVAF^ 

EPETQDATYLWWVIWQSLPVSPRIXJI^NGNRTLTLFNVTRNDTASYKCETQNPV 

SARRSDSVILNVLYGPDAPTISPLNTSYRSGENLNLSCHAASNPPAQYSWFVNGT 

FQQSTQELFIPNITVNNSGSYTCQAHNSDTGLNRTTVTTITVYAEPPKPFITSNNSN 

PVEDEDAVALTCEPEIQmrYLWWVNNQSLPVSPRLQI^NDimTLTLLSVTRND 

VGPYECGIQNELSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNP 

PAQYSWLroGMQQHTQELFISNITEKNSGLYTCQAlWSASGHSRTTVKTITVSAE 

IJKPSISSNNSKPVEDKDAVAFTCEPEAQhrrTYLWWVNGQSLPVSPRLQLSNGN 

RTLTLFNVTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPnSPPDSSYLSGA 

NLNLSCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGR 

NNSrVKSITVSASGTSPGLSAGATVGIMlGVLVGVALI 
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. PEPTIDE EPITOPES RECOGNIZED BY ANTIGEN SPECIFIC CD4'*' T 

LYMPHOCYTES 

Field of the Invention 
5 The invention relates to the identification of 

naturally processed and presented HLA class II restricted 
peptides and their use as therapeutics and prophylactics. 

Background of the Invention 
10 After internalization and proteolytic processing of 

intact protein antigens by antigen presenting cells 
(APCs), class II Major Histocompatibility Complex (MHC) 
molecules on the APCs bind short antigenic peptides 
(epitopes) derived from the antigens^ presenting the bound 
15 peptides to CD4"** T lymphocytes [Germain, R.N. (1994), Cell 
76:287-299]. Class II MHC genes and the molecules they 
encode are highly variable among individuals, and 
differences among the 

class II MHC molecules determine which peptides are 
20 selected for presentation as T cell epitopes. 

Identification of the naturally processed peptide 
fragments of a polypeptide of interest that are presented 
by class II MHC molecules of a subject can be useful for 
developing peptides that regulate immune response to the 
25 polypeptide in that subject. 

Carcinoembryonic antigen (CEA) was first described by 
Gold and Freedman (1965), J. Exp. Med. 121:439-462. CEA 
is a highly glycosylated, 180, 000-dalton protein that is 
expressed on most gastrointestinal carcinomas, including a 
30 large number of primary and metastatic colorectal tumors. 
It is also found on some normal, endodermally derived 
tissues, though in much lower concentrations. CEA cell 
surface expression can be detected using the monoclonal 
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antibody (mAb) Col-1, and CEA can be identified in total 
cellular protein by western blot analysis using the same 
mAb. An MHC class I {HLA-A2) -restricted cytotoxic 
T lymphocyte (CTL) response against a processed CEA 
5 epitope has been described in patients with metastatic 
disease [Tsang et al. (1995), J. Natl. Cancer. Inst. 
87:982-990]. This MHC class I-restricted epitope was 
identified by taking peripheral blood lymphocytes from 
patients after immunization with a recombinant vaccinia 

10 virus encoding CEA and screening the T cell response 

against synthetic 9- to 11-mer CEA peptides. CTLs from 
three patients were shown to have specificity for a class 
I-restricted CEIA epitope in an assay using autologous 
Epstein Barr Virus (EBV) -transformed B cells pulsed with a 

15 9 amino acid synthetic peptide derived from CEA. 

Generally, HLA class Il-restricted epitopes are 
different from class I-restricted epitopes, and produce 
different immunological responses. Epitopes presented by 
■ HLA class II molecules are generally, although not 

20 exclusively, recognized by CD4"*' T cells. The HLA class II 
restricted CD4+ T cell response can act as a helper 
response, providing help to HLA class I-restricted CTL, 
Class Il-restricted responses may also result in the 
recruitment of cells such as tumoricidal macrophages and 

25 eosinophils that work in unison to eliminate neoplastic 
cells. Furthermore, recent evidence suggests that T 
helper cells can elicit an anti-angiogenic response in the 
microenvironment of a tumor mass, thereby cutting off the 
blood supply to the rapidly dividing cells. 

30 The potential role of CD4"*' T cells in antitumor 

immunity and cancer immunotherapy has been the subject of 
general review [Topalian (1994), Current Opinion in 
Immunology 6:741-745; Pardoll and Topalian (1998), Current 
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Opinion in Irtmiunology 10:588-594; Toe et al. (1999), J. 
Exp. Med. 189:753-756]. Among other functions, CD4"^ T 
cells provide help and inununological memory, can play an 
important role in initiating and maintaining specific 
5 anti-tumor immunity, and may be instrumental in the 
phenomenon of tumor-specific T cell tolerance. 

SUMMARY OF THE INVENTION 
The invention features methods for identifying 

10 peptide epitopes that activate CD4^ T lymphocyte responses 
involved in the initiation, promotion, or exacerbation of 
certain diseases. 

The invention is based on the discovery that one can 
identify HLA class Il-restricted epitopes naturally 

15 produced by APC trans feet ed with DNA encoding a protein 
from which the epitopes are derived. These peptide 
epitopes can be incorporated into a drug delivery system 
and used to elicit an epitope-specif ic CD4* T cell response 
in an appropriate mammalian subject. Where the epitope is 

20 derived from a tumor antigen, such a response promotes the 
migration of CDS* CTL to the vicinity of tumor cells 
expressing the tumor antigen and specific targeting of the 
tumor cells by the CTL. This 004"*" T lymphocyte response 
could also promote migration of tumoricidal macrophages 

25 and eosinophils to the vicinity of the tumor cells to help 
kill the tumor cells. Finally, the CD4* T cell response 
can also produce an anti-angiogenic effect at the 
microenvironment of the tumor, cutting off the nutrient 
supply needed to continue cell division and tumor growth. 

30 Altered peptide ligands (APL) , which are class II 

MHC-binding variant peptides in which 1 to 6 amino acid 
residues are different from the corresponding residues of 
the wild-type class II MHC-binding peptide, but which 
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Still bind to the same class II MHC molecules as the wild- 
type peptides, are also encompassed by the invention, as 
are methods of therapy and prophylaxis involving the use 
of APL- APL have the ability to elicit different patterns 
5 of cytokine production in CD4'*' T cells than do their parent 
wild-type peptides. Thus, for example, while a wild-type 
peptide presented by a class II MHC molecule may induce 
production of Thl cytokines, an APL derived from it and 
presented by the same class II MHC molecule may elicit Th2 

10 or other immunoregulatory cytokines. Alternatively, the 
wild-type peptide may stimulate the production of Th2 
cytokines while a corresponding APL elicits production of 
Thl cytokines - 

More specifically, the invention features a method of 

15 identifying a class II MHC-binding fragment of a 
polypeptide. The method involves: (a) providing a 
mammalian antigen-presenting cell (APC) cell line 
transfected with a DNA encoding a polypeptide; (b) 
isolating from the APC a class II MHC molecule bound to a 

20 peptide, the peptide being a class II MHC-binding fragment 
of the polypeptide; (c) eluting the peptide from the class 
II MHC molecule; and (d)" identifying the peptide as a 
fragment of the polypeptide. The polypeptide can have the 
sequence of a tumor antigen (e.g., CEA) . The APC can be 

25 any class II MHC-expressing mammalian cell, e.g., a 
dendritic cell, a macrophage, a monocyte or a B 
lymphocyte, and the mammal from which it is derived can be 
a human. The class II MHC molecule can be a HLA-DR 
molecule, or a HLA-DQ molecule, or a HLA-DP molecule. 

30 Such a DR molecule can have a p-chain encoded by a 

DRB1*0401, DEIB1*G101, DRB1*0301, DRB1*0701, DRB1*1101, 
DRB1*1301, DRB1*1302, DRB1*1501, or DRB5*0101 gene. 
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The invention provides an isolated peptide fewer than 
35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, 
or 14) amino acid residues in length and including the 
sequence KEVLLLVHNLPQH (SEQ ID N0:1) . The peptide thus 
5 can include or be the amino acid sequence KEVLLLVHNLPQHL 
(SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID N0:3); 
KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL (SEQ ID 
NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6); or 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13). Also embraced by the 

10 invention is a peptide fewer than 35 (e.g., fewer than 33, 
30, 26, 22, 19, 18, 17, 16, 15, or 14) amino acid residues 
in length that includes or is the sequence YLWWVNGQSLPVSPR 
(SEQ ID NO: 7), optionally with additional CEA sequence on 
one or both ends. The invention also includes a peptide 

15 fewer than 35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 

17, 16, 15, or 14) amino acid residues in length and 
including the sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 
The peptide thus can include or be the amino acid sequence 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14) or NPPAQYSWLIDGNIQQHT 

20 (SEQ ID NO: 15) . 

Another embodiment of the invention is an altered 
peptide ligand (APL) the amino acid sequence of which is 
identical, except for 1-6 amino acid substitutions, to a 
fragment of carcinoembryonic antigen (CEA) , the fragment 

25 being fewer than 35 (e.g., fewer than 33, 30, 26, 22, 19, 

18, 17, 16, 15, or 14) amino acids residues in length and 
including or being the sequence KEVLLLVHNLPQH (SEQ ID 
N0:1), YLWWVNGQSLPVSPR (SEQ ID N0:7), or 
NPPAQYSWLIDGNIQQH (SEQ ID N0:14), optionally with 

30 additional CEA sequence on one or both ends. No more than 
30% of the amino acid residues of the fragment are 
substituted with different amino acid residues in the APL, 
and the APL can bind to a class II MHC molecule. The 
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sequence of the fragment can be, for example, 
KEVLLLVHNLPQH (SEQ ID N0:1), YLWWVNGQSLPVSPR (SEQ ID 
N0:7), or NPPAQYSWLIDGNIQQH (SEQ ID NO: 14), optionally 
with additional CEA sequence on one or both ends. In this 
5 method, the mammal can be one suspected of having or being 
susceptible to cancer. 

The invention also features a process for making an 
APL. The method involves: (a) carrying out the above 
method of identifying a class II MHC-binding fragment of a 

10 polypeptide, and (b) synthesizing an APL consisting of a 
sequence which is identical to that of the peptide, except 
having amino acid sxibstitutions at 1, 2, 3, 4, 5, or 6 
positions in the peptide. The polypeptide can be a tumor 
antigen such as CEA. 

15 Another aspect of the invention is a method of 

activating T cell reactivity in a mammal. The method 
involves: (a) providing (i) a peptide, the sequence of 
which consists of the sequence of a naturally processed 
fragment of CEA, the peptide being capable of binding to a 

20 class II MHC molecule of the mammal and of eliciting a CD4'^ 
T cell response, or (ii) a DNA encoding a polypeptide that 
is (1) the peptide, (2) the peptide plus an amino terminal 
methionine residue, and (3) either (1) or (2) linked to a 
trafficking sequence; and (b) administering the peptide or 

25 DNA to the mammal. The peptide can include or be 

KEVLLLVHNLPQH (SEQ ID N0:1), KEVLLLVHNLPQHL (SEQ ID NO: 2), 
KEVLLLVHNLPQHLF (SEQ ID NO: 3), KEVLLLVHNLPQHL FG (SEQ ID 
NO: 4), GKEVLLLVHNLPQHL (SEQ ID NO: 5), EGKEVLLLVHNLPQHLFG 
(SEQ ID N0:6), YLWWVNGQSLPVSPR (SEQ ID N0:7), 

30 EGKEVLLLVHNLPQHL (SEQ ID NO: 13), NPPAQYSWLIDGNIQQH (SEQ ID 
N0:14), or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15), optionally 
with additional CEA sequence on one or both ends. 
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The invention also provides a method of altering a T 
cell response in a laammal. The method involves: (a) 
providing (i) an APL having a sequence identical, except 
for amino acid substitutions at 1-6 positions, to the 
5 sequence of a naturally-processed fragment of CEA, the APL 
being able to bind to a class II MHC molecule of the 
mammal, or (ii) a DNA encoding a polypeptide that can be 
(1) the APL, (2) the APL plus an amino terminal methionine 
residue, and (3) either (1) or (2) linked to a trafficking 
10 sequence; and (b) administering the APL or DNA to the 
mammal. 

The above described method of identifying a class II 
MHC-binding fragment of a polypeptide can include the 
additional steps of: (e) providing lymphocytes from a 

15 mammal having a condition suspected of being associated 
with presentation of the peptide by the class II MHC 
molecule, the APCs of the mammal bearing the class II MHC 
molecule; (f) providing a population of APCs that bear the 
class II MHC molecule with the peptide bound thereto; (g) 

20 contacting the population of APCs of (f) with the CD4* 
lymphocytes of (e) ; and (h) determining whether the CD4* 
lymphocytes recognize the class II MHC-bound peptide, as 
an indication that presentation of the peptide to CD4'^ T 
lymphocytes is associated with the condition. The 

25 presentation can be associated with a pathological 

response of CD4**' T lymphocytes or a protective response of 
CD4'*" T lymphocytes. 

Another aspect of the invention is a method of 
diagnosis that involves: (a) providing a CD4^ lymphocyte 

30 from an individual suspected of having or being 

susceptible to cancer; (b) providing an APC which bears on 
its surface a class II MHC molecule of an allele identical 
to one expressed by the individual, the class II MHC 
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molecule being bound to a CEA peptide; (c) contacting the 
APC with the CD4"*" lymphocyte; and (d) determining whether 
the CD4'*' lymphocyte recognizes the class II MHC-bound 
peptide, as an indication that the individual has or is 
5 susceptible to cancer. The peptide can include or be the 
amino acid sequence KEVLLLVHNLPQH (SEQ ID N0:1); 
KEVLLLVHNLPQHL (SEQ ID NO: 2;) KEVLLLVHNLPQHLF (SEQ ID 
NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 
(SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID N0:6); 

10 YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGECEVLLLVHNLPQHL (SEQ ID 
N0:13); NPPAQYSWLIDGNIQQH (SEQ IDN0:14); or 
NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15), optionally with 
additional CEA sequence on one or both ends. 

The invention also features a method of treating a 

15 subject suspected of having or being susceptible to 

cancer. The method involves administering a peptide to 
the subject. The peptide can be fewer than 35 (e.g., 
fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, or 14) 
amino acid residues in length and include a sequence 

20 KEVLLLVHNLPQH (SEQ ID N0:1), optionally with additional 
CEA sequence on one or both ends. Alternatively, the 
peptide can be fewer than 35 (e.g., fewer than 33, 30, 26, 
22, 19, 18, 17, 16, 15, or 14) amino acid residues in 
length and include or be the sequence YLWWVNGQSLPVSPR (SEQ 

25 ID NO: 7), optionally with additional CEA sequence on one 
or both ends. Furthermore, the peptide can be fewer than 
35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, 
or 14) amino acid residues in length and include or be the 
sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14), optionally with 

30 additional CEA sequence on one or both ends. 

The invention also provides a method of identifying a 
reagent for diagnosing cancer. The method involves: 
(a) providing a test reagent that can be a Fab fragment, a 



- 8 - 



wo 2002/022803 



PCT/US2001/028467 



monoclonal antibody (mAb) , or a single chain Fv (scFv) 
fragment; (b) providing a complex that contains a class II 
MHC molecule bound to a peptide that includes or is 
KEVLLLVHNLPQH{SEQ ID N0:1); KEVLLLVHNLPQHL (SEQ ID N0:2); 
5 KEVLLLVHNLPQHLF (SEQ ID NO: 3); KEVLLLVHNLPQHL EXS (SEQ ID 
NO: 4); GKEVLLLVHNLPQHL (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG 
(SEQ ID NO: 6); YLWWVNGQSLPVSPR (SEQ ID N0:7); 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID 
NO: 14); or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15); and 

10 (c) testing whether the test reagent binds to the complex, 
The class II MHC molecule can be a DR molecule with a p- 
chain encoded by a DRB1*0401 gene or by a DRB1*0101 gene. 
As used herein, a ^'scFv" fragment is a recombinant 
fragment of an antibody molecule that contains, in a 

15 single polypeptide chain, the antigen-binding regions of 
an immunoglobulin (Ig) heavy and an Ig light chain, scFv 
fragments generally either contain (a) no Ig heavy or Ig 
light chain constant regions or (b) less than the whole 
constant region of an Ig heavy and/or Ig light chain. 

20 Also embraced by the invention is a method of 

diagnosis. The method involves: (a) providing a test cell 
from a mammalian subject; (b) providing a reagent that 
binds to a CEA peptide fragment bound to a class II MHC 
molecule; (c) contacting the test cell with the reagent; 

25 and (d) detecting binding of the reagent to the test cell 
as an indication that the test cell is a cancer cell. 

The invention also features a method of cancer 
therapy that involves: (a) providing a composition that 
includes a reagent that can be a Fab fragment, a mAb, or a 

30 scFv fragment, the reagent being able to recognize a 

naturally processed CEA peptide bound to a MHC class II 
molecule and being linked to an agent that can be a 
chemotherapeutic compound, a radioactive isotope or a 
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toxin; and (b) administering the composition to a subject 
suspected of having or being susceptible to a cancer 
characterized by expression of CEA. 

Another aspect of the invention is a method of 
5 identifying a class II MHC-binding fragment of a tumor 
antigen. The method involves: (a) providing a mammalian 
APC that contains a class II MHC molecule and the tumor 
antigen; (b) isolating from the APC the class II MHC 
molecule bound to a peptide, the peptide being a class II 

10 MHC binding fragment of the tumor antigen; (c) eluting the 
peptide from the class II MHC molecule; and (d) 
identifying the amino acid sequence of the peptide. The 
method can involve the additional steps of: (e) providing 
CD4* lymphocytes from a mammal having a cancer suspected of 

15 being associated with presentation of the peptide by the 
class II MHC molecule, the APCs of the mammal bearing the 
class II MHC molecule; (f ) providing a population of APCs 
that bear the class II MHC molecule with the peptide bound 
thereto; (g) contacting the population of APCs of (f) with 

20 the CD4* lymphocytes of (e) ; and (h) determining whether 
the CD4"*" lymphocytes recognize the class II MHC bound 
peptide, as an indication that presentation of the peptide 
to CD4"^ lymphocytes is associated with the cancer. 

The invention also provides an isolated DNA that 

25 contains a nucleotide sequence that encodes a peptide 
fewer than 35 (e.g., fewer than 33, 30, 26, 22, 19, 18, 
17, 16, 15, or 14) amino acid residues in length, and that 
contains the sequence KEVLLLVHNLPQH (SEQ ID N0:1). The 
peptide encoded by the DNA thus can include or be, for 

30 example, the amino acid sequence KEVLLLVHNLPQHL (SEQ ID 
NO: 2); KEVLLLVHNLPQHLF (SEQ ID NO: 3); KEVLLLVHNLPQHLFG 
(SEQ ID NO: 4); GKEVLLLVHNLPQHL (SEQ ID N0:5); 
EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6); or EGKEVLLLVHNLPQHL (SEQ 
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ID NO: 13)/ optionally with additional CEA (or non-CEA) 
sequence on one or both ends. Also embraced by the 
invention is DNA encoding a peptide fewer than 35 (e.g., 
fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, or 14) 

5 amino acid residues in length that includes or is the 
sequence YLWWVNGQSLPVSPR (SEQ ID NO: 7), optionally with 
additional CEA sequence on one or both ends. The 
invention also features an isolated DNA that contains a 
nucleotide sequence that encodes a peptide fewer than 35 

10 (e.g., fewer than 33, 30, 26, 22, 19, 18, 17, 16, 15, or 
14) amino acid residues in length, and that contains the 
sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). The peptide 
encoded by the DNA thus can include or be, for example, 
the amino acid sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14), 

15 or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15), optionally with 
additional CEA sequence on one or both ends. Also 
featured is a vector containing the above DNA of the 
invention. In the vector the nucleotide sequence can be 
operatively linked to a transcriptional regulatory 

20 element. Also included in the invention is a cell (e.g., 
a mammalian, an insect, a bacterial, a yeast, or a fungal 
cell) containing any of the vectors of the invention. 

Another aspect of the invention is an isolated 
peptide containing a fragment of CEA that includes: 

25 (a) KEVLLLVHNLPQH (SEQ ID N0:1) and a further 1-15 

residues of CEA sequence at the C-terminus of SEQ ID N0:1, 
at the N-terminus of SEQ ID N0:1, or at both the N- 
terminus and the C-terminus of SEQ ID N0:1; 
(b) YLWWVNGQSLPVSPR (SEQ ID N0:7) and a further 1-15 

30 residues of CEA sequence at the C-terminus of SEQ ID NO: 7, 
at the N-terminus of SEQ ID NO: 7, or at both the N- 
terminus and the C-terminus of SEQ ID NO: 7; or (c) 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14) and a further 1-15 
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residues of CEA sequence at the C-terminus of SEQ ID 
NO: 14, at the N-terminus of SEQ ID NO: 14, or at both the 
N-terminus and the C-terminus of SEQ ID NO: 14- This 
peptide can be used in a method of activating T cell 
5 reactivity in a mammal. The method involves: (a) 

providing (i) any of the peptide or (ii) a DNA encoding a 
polypeptide which can be (1) the peptide^ (2) the peptide 
plus an amino terminal methionine residue, or (3) either 
(1) or (2) linked to a trafficking sequence; and (b) 
. 10 administering the peptide or DNA to the mammal. 

Another embodiment of the invention is an isolated 
fragment of CEA shorter than full-length CEA; the fragment 
includes one or more amino acid sequences selected from 
the group consisting of SEQ ID N0:1, SEQ ID NO: 7, and SEQ 

15 ID NO: 14. This fragment of CEA can be used in a method of 
activating T cell responsiveness in a mammal. The method 
involves: (a) providing (i) the CEA fragment, or (ii) a 
DNA encoding a polypeptide that can be (1) the CEA 
fragment, (2) the CEA fragment plus an amino terminal 

20 methionine residue, and (3) either (1) or (2) linked to a 
trafficking sequence, the DNA not encoding full-length 
CEA; and 

(b) administering the fragment or DNA to the mammal. 

Another method of activating T cell responsiveness in 

25 a mammal involves: (a) administering to the mammal 

(i) CEA or (ii) a DNA encoding CEA; and (b) testing CD4+ T 
cells of the animal for responsiveness to a naturally 
processed fragment of CEA (e.g., fragments with SEQ ID 
N0S:l-7 and 13-15) . 

30 The invention also provides an isolated peptide that 

includes: (a) at least one CEA segment that can be' a 
segment with SEQ ID N0:1, SEQ ID NO: 7, and SEQ ID NO: 14; 
and (b) one or more amino acids at the C-terminus of the 
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CEA segment, at the N-terminus of the CEA segment, or at 
both the C-terminus and the N-terminus of the CEA segment, 
the peptide having an amino acid sequence that is not 
identical to that of full-length CEA. This peptide can be 
5 used in a method of activating T cell reactivity in a 
mammal. The method involves: (a) providing (i) the 
peptide, or (ii) a DNA encoding a polypeptide that can be 
(1) the peptide, (2) the peptide plus an amino terminal 
methionine residue, and (3) either (1) or (2) linked to a 
10 trafficking sequence, the DNA not encoding full-length 
CEA; and (b) administering the peptide or DNA to the 
mammal . 

An "isolated" peptide of the invention is a peptide 
that either has no naturally-occurring counterpart (e.g., 

15 an APL) , or has been separated or purified from components 
which naturally accompany it,- e.g., in tissues such as 
pancreas, liver, spleen, ovary, testis, muscle, joint 
tissue, neural tissue, gastrointestinal tissue, or body 
fluids such as blood, serum, or urine. Such a naturally- 

20 occurring peptide is considered "isolated" when it is at 
least 70%, by dry weight, free from the proteins and 
peptides with which it is naturally associated. 
Preferably, a preparation of a peptide of the invention is 
at least 80%, more preferably at least 90%, and most 

25 preferably at least 99%, by dry weight, the peptide of the 
invention. Since a peptide that is chemically synthesized 
is, by its nature, separated from the components that 
naturally accompany it in the tissue where it occurs 
naturally, the synthetic peptide is, by definition, 

30 "isolated. " 

7^ isolated peptide of the invention can be obtained, 
for example, by extraction from a natural source (e.g., 
from human tissues or bodily fluids) ; by expression of a 



- 13 - 



wo 2002/022803 



PCTAJS2001/028467 



recombinant nucleic acid encoding the peptide; or by 
chemical synthesis, A peptide that is produced in a 
cellular system different from the source from which it 
naturally originates is "isolated," because it will be 
5 separated from components which naturally accompany it. 
The extent of isolation or purity can be measured by any 
appropriate method, e.g., column chromatography, 
polyacrylamide gel electrophoresis, mass analysis, or HPLC 
analysis. 

10 An "isolated DNA" means DNA free of the genes that 

flank the gene of interest (e.g., the gene encoding CEA) 
in the genome of the organism in which the gene of 
interest naturally occurs. The term therefore includes a 
recombinant DNA incorporated into a vector, into an 

15 autonomously replicating plasmid or virus, or into the 
genomic DNA of a prokaryote or eukaryote. It also 
includes a separate molecule such as: a cDNA where the 
corresponding genomic DNA has introns and therefore a 
different sequence; a genomic fragment; a fragment 

20 produced by polymerase chain reaction (PGR) ; a restriction 
fragment; a DNA encoding a non-naturally occurring 
protein, fusion protein, or fragment of a given protein; 
or a nucleic acid which is a degenerate variant of a 
naturally occurring nucleic acid. In addition, it 

25 includes a recombinant nucleotide sequence that is part of 
a hybrid gene, i.e., a gene encoding a fusion protein. 
Also included is a recombinant DNA that includes a DNA 
sequence that encodes any of the peptides with SEQ ID 
N0S:l-7 and 13-15, 

30 As used herein, "protection from a mammalian disease" 

means prevention of onset of a mammalian disease or 
lessening the severity of a disease existing in a mammal, 
"Prevention" can include a delay of onset, as well as a 
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partial or complete block in progress of the disease or 
disease symptoms. 

As used herein, "a naturally-processed peptide 
fragment" is a peptide fragment produced by proteolytic 
5 degradation of a protein in an antigen presenting cell of 
a mammal. As used herein, a "tumor antigen" is a molecule 
(e.g., a protein molecule) that is expressed by a tumor 
cell. Such a molecule can differ (e.g., by one or more 
amino acid residues where the molecule is a protein) from, 
10 or it can be identical to, its counterpart expressed in 
normal cells. It is preferably not expressed by normal 
cells. Alternatively, it is expressed at a higher level 
(e.g., a two-fold, three-fold, five-fold, ten- fold, 20- 
fold, 40-fold, 100-fold, 500-fold, 1,000-fold, 5,000-fold, 
15 or 15,000-fold higher level) in a tumor cell than in the 
tumor cell's normal counterpart. Examples of tumor 
antigens include, without limitation, CEA, prostate 
specific antigen (PSA), MAGE (melanoma antigen) 1-4, 6 and 
12, MUC (mucin) (e.g., MUC-1, MUC-2, etc.), tyrosinase, 
20 MART (melanoma antigen), Pmel 17(gpl00), GnT-V intron V 
sequence (N-acetylglucoaminyltransferase V intron V 
sequence) , Prostate Ca psm, PRAME (melanoma antigen) , p- 
catenin, MUM-l-B (melanoma ubiquitous mutated gene 
product), GAGE (melanoma antigen) 1, BAGE (melanoma 
25 antigen) 2-10, C-ERB2 {Her2/neu) , EBNA (Epstein-Barr Virus 
nuclear antigen) 1-6, gp75, human papilloma virus (HPV) E6 
and E7, p53, lung resistance protein (LRP) , Bcl-2, and Ki- 
67. Recognition of such a peptide by CD4* T cells of a 
mammal (e.g., a hiaman patient) is indicative of the 
30 existence, or future onset, of cancer in the mammal. 
Unless otherwise defined, all technical and 
scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art to 
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which this invention'pertains. In case of conflict, the 
present document, including definitions, will control. 
Preferred methods and materials are described below, 
although methods and materials similar or equivalent to 
those described herein can be used in the practice or 
testing of the present invention. Unless otherwise 
indicated, these materials and methods are illustrative 
only and are not intended to be limiting. All 
publications, patent applications, patents and other 
references mentioned herein are incorporated by reference. 

Other features and advantages of the invention, e.g., 
methods of identifying peptides that activate CD4* T 
lymphocyte responses, will be apparent from the following 
description, from the drawings and from the claims. 

Brief Descri ption of the Drawing s 
Fig. lA is a depiction of the nucleotide sequence- of 
CEA-encoding cDNA (SEQ ID NO: 12) that was inserted into 
the expression vector pcDNA3. 1-Zeo. 

Fig. IB is a depiction of the amino acid sequence of 
CEA (SEQ ID NO: 16) encoded by the cDNA that was inserted 
into the expression vector pcDNA3. 1-Zeo. 

Fig. 2 is a histogram showing the fluorescence- 
activated flow cytometric profiles of JY cells that were 
stained with CEA specific murine mAb and fluorescein 
isothiocyanate (FITC) -conjugated goat antibody specific 
for murine immunoglobulin. 

Fig. 3 is a diagram showing two appropriately aligned 
matrix assisted laser desorption time-of -flight (MALDI- 
TOF) mas spectometry spectra derived from two mixtures of 
peptides in which separate aliquots of peptide mixtures 
prepared from A2. 140-pcDNA3 and A2 . 140-CEA-38 cells were 
analyzed by IMF procedures. 
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Fig. 4 is a MS/MS Spectrum of naturally processed 
KEVLLLVHNLPQH (SEQ ID NO: 1) . The two background 
responses were derived from a peptide that co-eluted 
chromatographically with KEVLLLVHNLPQH (SEQ ID NO: 1) and 
5 was closely related in M/2. These peptides (KEVLLLVHNLPQH 
(SEQ ID NO: 1) and the co-eluting species) could be only 
partially resolved chromatographically, but this did not 
prevent the identification of KEVLLLVHNLPQH (SEQ ID NO: 1) 
from the resultant composite mass spectrum. Spectral 
10 labeling in Figs. 3-9 use the nomenclature described by 
Roepstorff and Fohlman (1988), Biomed. Mass. Spectrom. 
11:601, and Biemann (1990), Meth. Enzymol. 193:866-867. 

Fig. 5 is a MS/MS spectrum of naturally processed 
KEVLLLVHNLPQHL (SEQ ID NO: 2) 
15 Fig. 6 is a MS /MS spectrum of naturally processed 

KEVLLLVHNLPQHLF (SEQ ID NO: 3). 

Fig. 7 is a MS /MS spectrum of naturally processed 
KEVLLLVHNLPQHLFG (SEQ ID NO: 4). 

Fig. 8 is a MS /MS spectrum of naturally processed 
20 GKEVLLLVHNLPQHL (SEQ ID NO: 5). 

Fig. 9 is a MS /MS spectrum of naturally processed 
EGKEVLLLVHNLPQHLFG (SEQ ID N0:6)- 

Fig. 10 is a MS /MS spectrum of naturally processed 
YLWWVNGQSLPVSPR (SEQ ID NO: 7). 
25 Fig. 11 is a MS /MS spectrum of naturally processed 

EGKEVLLLVHNLPQHL (SEQ ID NO: 13). 

Fig. 12 is a MS /MS spectrum of naturally processed 
NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 

Fig. 13 is a MS /MS spectrum of naturally processed 
30 NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 
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DETAILED DESCRIPTION 
The present invention is based on the novel discovery 
that, by trans feet ing an APC with an expression vector 
5 containing a DNA sequence encoding a polypeptide of 

interest (PPI)/ the APC will synthesize the PPI and then 
transport it to one of the antigen-processing organelles 
within the cell, e.g., endosomes, lysozomes and structures 
designated "MHC class II con^artments" (MIIC) that have 

10 lysosomal characteristics and are enriched for class II 
MHC molecules but are substantially devoid of class I MHC 
molecules [Peter et al. (1991), Nature 349:669-676], The 
PPI is degraded by proteolytic enzymes into peptide 
fragments. If any of these peptide fragments has, by 

15 virtue of its length and sequence, the ability to bind to 
one of the class II MHC molecules expressed by the APC, it 
will do so in the antigen processing organelle. The 
resulting peptide-class II MHC molecular complex is then 
transported to the APC's cell membrane, where it becomes 

20 available for interaction with CD4'*' T cells bearing 

antigen-specific receptors that specifically recognize 
that particular peptide-class II MHC complex. By eluting 
peptides from* class II MHC molecules isolated from these 
APC/ a set of naturally processed peptides derived from 

25 the PPI, as well as from other polypeptides of 

intracellular or extracellular origin, is obtained. The 
peptides, which are specific to the particular types 
(isotypes and alleles) of class II MHC molecules expressed 
by the APC, are then chemically separated and their amino 

30 acid sequences determined. By comparison of the peptide 
amino acid sequences to the sequence of the PPI, it is 
possible to identify those which are derived from the PPI. 
Thus, the present invention includes a method of 
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identifying peptide fragments that are naturally processed 
by APC and have intrinsic binding affinity for the 
relevant class II MHC molecule. The method can be 
invaluable for identifying peptides derived from a 
5 polypeptide suspected of being an antigen that activates 
CD4^ T cells involved in either (a) the pathogenesis 
(pathology) of a disease^ especially one in which 
susceptibility or protection is known to be associated 
with expression of a particular type of class II MHC 

10 molecule, or (b) prevention or reduction of the symptoms 
of a disease, especially one in which protection or a 
reduction in severity is associated with expression of a 
particular type of class II MHC molecule. The method is 
designated "Immunological Mass Fingerprinting" ("IMF"). 

15 The described method ensures that the peptides 

identified are those that both (i) are naturally processed 
in vivo by the APC, and (ii) become associated, in the 
APC, with the relevant class II MHC molecules. 

Furthermore, the present method controls for class II 

20 MHC type, an important aspect essential to link any given 
peptide to a particular CD4*^ T cell-mediated disease in a 
given individual but especially important in disorders in 
which class II MHC type determines disease susceptibility 
or resistance. 

25 Any naturally processed peptide with a sequence that 

corresponds to a fragment of the PPI, and which binds to a 
class II MHC molecule associated with the disease of 
interest, could be a peptide that activates CD4'*' T cells 
that either initiate, promote, or exacerbate the disease 

30 or mediate immunity to it. To obtain confirmatory 
evidence of this possibility, test CD4'*" T cells from 
subjects expressing the relevant class II MHC molecules 
can be assayed for responsiveness to a peptide identified 
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in accordance with the invention. Control CD4* T cells can 
be from subjects also expressing the class II MHC molecule 
but without symptoms of the disease. A significant 
response of the test CD4"*" T cells and no or little response 
5 of the control CD4* T cells would indicate that the 

relevant peptide is involved either in the disease process 
(pathology of the disease) or in immunity to the disease. 
The cellular response phase of the method is designated 
"Epitope Verification" ("EV") . 

10 By applying the methods of the invention to the tumor 

antigen CEA^ CBA-derived peptides were identified as 
epitopes that could be involved in the pathogenesis of 
cancer in human patients expressing the DR4 or DRl class 
II MHC allele. Based on their amino acid sequences, these 

15 peptides fall into 3 nested groups. A consensus peptide 
corresponding to the core region of each nested group can 
be synthesized and tested for its ability to activate CD4'*' 
T cells from either DR4, DR13, or DRl-expressing cancer 
patients or DR4, DR13, or DRl-expressing subjects without 

20 disease symptoms. 

The methods of the invention can be applied to 
identifying peptides involved in the pathogenesis of or 
protection from any of a wide range of diseases, 
especially those in which relative susceptibility or 

25 resistance, has been associated with expression of a 

particular class II MHC allele, provided that the amino 
acid sequence (or partial amino acid sequence) of a 
suspect polypeptide antigen is available. Candidate 
diseases include, without limitation, infectious diseases 

30 (e.g., diseases caused by Chlamydia trachomatis ^ 

Helicobacter pylori. Neisseria meningitidis , Mycobacterium 
leprae f M. tuberculosis. Measles virus, hepatitis C virus, 
human immunodeficiency virus, and Plasmodium falciparium) , 
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cancer (e.g. melanoma, ovarian cancer, breast cancer, 
colon cancer and B cell lymphomas) [Topalian (1994), Curr. 
Opinion in Immunol. 6: 741-745; Topalian et al. (1996), J. 
Exp. Med. 183: 1965-1971], and autoimmune diseases (e.g., 
5 insulin dependent diabetes mellitus, rheumatoid arthritis, 
multiple sclerosis, myasthenia gravis, and systemic lupus 
erythmatosus) . 

The invention also includes peptides derived from CEA 
using the above method, as described in Example 1. Also 

10 included in the invention are APL derived from the CEA 
peptides by replacing 1 to 6 (i.e., 1, 2, 3, 4, 5, or 6) 
amino acid residues of a naturally processed peptide that 
activates an immune response. Each residue is replaced 
with a different residue, resulting in a variant peptide 

15 that still binds to the same class II MHC allele, but 

elicits qualitatively different responses in CD4"'" T cells 
than does the parent peptide from which the APL is 
derived. Thus, APL have the potential to be therapeutic 
and/or prophylactic in diseases in which the CD4"*" T cell 

20 response to the relevant parent peptide is pathogenic. 

The invention features methods of therapy and prophylaxis 
involving the use of APL. The invention also features use 
of disease-related peptides in the diagnosis of disease or 
monitoring immune-based therapy. 

25 

1. Methods of Identifying CD4'*' T Cell Activating 

Peptide Epitopes Derived From Polypeptide Antigens 

The methods of the invention have two distinct 
30 phases. The first is termed "Immunological Mass 
Fingerprinting" (IMF) and the second "Epitope 
Verification" (EV) - The purpose of the IMF is to direct a 
candidate polypeptide to any one of the antigen processing 
compartments of an APC where it can be degraded to peptide 
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fragments. Any peptides of the appropriate length (about 
9 to 25 amino acid residues), and having specific binding 
affinity for a particular class II MHC molecule expressed 
by the APC, will bind to that class II MHC molecule in the 
5 antigen processing compartments. At least some of these 
peptide-class II MHC molecular complexes then migrate to 
the cell membrane of the APC, The complexes (both cell- 
membrane associated and intracellular) are isolated from 
the APC and the peptides eluted from the complexes. The 

10 eluted peptides are then separated, their amino acid 

sequences determined, and the sequences compared to that 
of the candidate polypeptide . 

The IMF can generally be applied to the analysis of 
peptides produced by an APC expressing defined class II 

15 MHC molecules. As such, the method can be useful for 

basic research studies, e.g., studies aimed at identifying 
amino acid residues in a polypeptide that determine sites 
of "cutting" by the proteolytic antigen processing enzymes 
of APC. Alternatively, where the polypeptide is suspected 

20 of being an antigen that activates CD4'^ T cells that cause 
or promote a particular disease or mediate protection from 
a disease, the IMF can be used to identify disease-related 
or protective peptide epitopes derived from the 
polypeptide. This information would be useful for basic 

25 research into the etiology of the disease, or as a basis 
for development of diagnostics, therapeutics, or vaccines 
for the disease. 

A peptide whose amino acid sequence matches that of a 
region of the candidate polypeptide is likely to be one 

30 that activates CD4* T cells involved in the pathogenesis of 
or immunity to the relevant disease. Such a peptide can 
be subjected to the EV procedure in which its ability to 
activate CD4'*" T cells from test and control subjects is 
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assayed. Those peptides that activate 004"^ T cells from 
test subjects but not those from control subjects are 
identified as peptides that can initiate, promote, or 
exacerbate the relevant disease or mediate protection from 
5 disease or its pathogenic symptoms. 

Once such a peptide is identified, it can be 
synthesized in large amounts, by chemical or recombinant 
techniques, and used in diagnostic assays similar to the 
EV procedures listed below. Relevant peptides could be 

10 used singly or in combination. Alternatively, expression 
vectors encoding such a peptide or a combination of such 
peptides can be used to transfect or transduce appropriate 
APC (see below), and these can be used in similar 
diagnostic assays. 

15 Furthermore, multimers (e.g., dimers, trimers, 

tetramers, pentamers, or hexamers) of a class II MHC 
molecule complexed with a peptide defined by the method of 
the invention and conjugated to a detectable label (e.g., 
a fluorescent moiety, a radionuclide, or an enzyme that 

20 catalyzes a reaction resulting in a product that absorbs 
or emits light of a defined wavelength) can be used to 
quantify T cells from a subject (e.g., a human patient) 
bearing cell surface receptors that are specific for such 
complexes. Relatively high numbers of such T cells are 

25 likely to be diagnostic of a relevant disease or an 

indication that the T cells are involved in immunity to 
the disease. In addition, continuous monitoring of a 
patient's relative numbers of multimer-binding T cells can 
be useful in tracking the course of a disease or the 

30 efficacy of therapy. Such assays have been developed 

using tetramers of class I MHC molecules complexed with an 
HIV-l-derived or an influenza virus-derived peptide 
[Altman et al. (1996), Science 274:94-96; Ogg et al. 
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(1998), Science 279:2103-2106], and corresponding class II 
MHC multimers would be expected to be similarly useful. 
Such complexes could be produced by chemical cross-linking 
of purified class II MHC molecules assembled in the 
5 presence of a peptide of interest or by modification of 
already established recombinant techniques for the 
production of class II MHC molecules containing a single 
defined peptide [Kazono et al. (1994), Nature 369:151-154; 
Gauthier et al- (1998), Proc. Natl. Acad. Sci. U.S.A. 

10 95:11828-11833]. The class II MHC molecule monomers of 
such multimers can be native molecules composed of full- 
length a and p chains. Alternatively, they can be 
molecules containing only the extracellular domains of the 
a and p chains or the a and p chain domains that form the 

15 "walls" and "floor" of the peptide-binding cleft. 

1.1 IMF 

In IMF, the APC is transfected (transiently or 
stably) with the cDNA encoding the PPI (e.g., CEA) using 

20 standard methods known in the art, and high expressing 
clonal populations are isolated for preparative growth. 
As used herein, the term "transfected" includes infected. 
Thus, the cDNA encoding the PPI can be in an expression 
vector such as a plasmid or an infectious agent such as a 

25 virus. Useful viruses include vaccinia virus, measles 
virus, alpha viruses, and herpes viruses. Additional 
suitable viruses are listed below in the section 
describing APL. Infection of the APC can be performed in 
vitro or in vivo in a mammalian subject prior to 

30 harvesting of the cells from the subject. After producing 
a sufficient number of cells (-1 x 10^ to 1 x 10^° 
depending on the expression of the target tumor antigen) , 
the class II MHC molecules of interest are isolated from 
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the APC by any one of various methods known in the art^ 
e.g., immunoprecipitation. They may be isolated by 
affinity chromatography by the method of Gorga et al. 
(1987), J. Biol. Chem, 262:16087-16094. 
5 Peptides bound non-covalently to the isolated 

class II MHC molecules are then eluted from the latter. A 
variety of methods known in the art can be used, for 
example, the method of Chicz et al. (1992), Nature 
358:764-768; Chicz and Urban, Immunology Today 15:155-160; 

10 Urban et al. Critical Reviews in Immunology 17:387-397 or 
the novel solid phase extraction protocol as in Example 1. 

The eluted peptides are separated by one of a variety 
of possible chromatographic methods, e.g,, reverse phase 
chromatography. All the resulting fractions that contain 

15 peptides are then individually analyzed by MALDI-TOF mass 
spectrometry, using settings that do not fragment the 
peptides. The peptides corresponding to all the "peaks" 
obtained on the MALDI-TOF spectrum can then be subjected 
to individual amino acid sequence analysis. 

20 Alternatively, only novel responses in the ^'test" cell- 
line can be subjected to amino acid sequence analysis. 
Novel responses are peaks that are not observed in a 
control spectrum generated using a sample of peptides 
obtained by an identical procedure but omitting the 

25 transfection of cells with a plasmid that contains a DNA 
sequence that encodes the PPI. The sequences of the 
individual peptides can be obtained by means known to 
those in the art. They can, for example, be obtained by 
MALDI-TOF, using instrument settings resulting in the 

30 fragmentation of the peptides into small fragments that 
are analyzed by the mass spectrometer- The amino acid 
sequences of the peptides are then compared to that of the 
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PPI. Those with a sequence identical to a region of the 
PPI are candidates for EV. 

Alternatively, other approaches for deriving the 
amino acid sequences of individual peptides can be 
5 utilized. These methods invoke a second dimension of 
peptide separation prior to mass spectrometric analysis. 
This is often achieved by coupling a separation technique 
such as reversed phase HPLC to a mass analyzer (such as 
but not limited to a quadrupole ion trap, triple 

10 quadrupole instriament, magnetic sector, Fourier 

transformation cyclotron resonance, quadropole time-of- 
f light, or a hybrid of these analyzers) though an 
electrospray interface. First, novel peptide responses 
that were detected in the ^^test" transfected cell-line but 

15 not the control can be specifically isolated by the mass 
spectrometer and fragmented to yield amino acid sequences - 
Second, fractions can be analyzed in a data dependent mode 
of operation. In this mode, mass peaks are dynamically 
and automatically selected for isolation and fragmentation 

20 to yield amino acid sequences. No prior knowledge of 
novel signals is required for this mode of peptide 
sequencing - 

1.2 EV 

25 The EV procedure involves testing of peptides 

identified by IMF for their ability to (a) bind the 
class II MHC from which they were eluted and (b) activate 
various CD4'*' T cell populations. Peptides with amino acid 
sequences either identical to those identified by IMF or 

30 corresponding to a core sequence derived from a nested 
group of peptides identified by the IMF are synthesized. 
The synthetic peptides are then tested for their ability 
to bind the class II MHC from which they were eluted and 
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activate CD4'*" T cells from (a) test subjects expressing the 
class II MHC molecule of interest and having at least one 
symptom of the disease; and (b) control subjects 
expressing the class II MHC molecule of interest and 
5 having no symptoms of the disease. Additional control 

subjects can be those with symptoms of the disease and not 
expressing the class II MHC molecule of interest. In some 
diseases (e.g., those with an autoimmune component), 
responsiveness in the CD4* T cells of test subjects but not 

10 in CD4"^ T cells of the control subjects described in (b) 
provides confirmatory evidence that the relevant peptide 
is an epitope that activates CD4'*' T cells that can 
initiate, promote, or exacerbate the relevant disease. In 
other diseases (e.g., cancer or infectious diseases 

15 without an autoimmune component), a similar pattern of 

responsiveness and non-respo'nsiveness to that described in 
the previous sentence would indicate that the relevant 
peptide is an epitope that activates CD4^ T cells that can 
mediate immunity to the disease or, at least, a decrease 

20 in the symptoms of the disease. 

Absence of a response in subjects with symptoms of 
the disease but not expressing the class II MHC molecule 
provides further evidence for the stated activities of the 
peptide. On the other hand, a response in such control 

25 CD4'*' T cells would not necessarily exclude such a role for 
the peptide but would suggest that the relevant peptide is 
capable of (i) binding to some MHC class II molecule 
expressed by the relevant subject; and (ii) being 
recognized by CD4* T cells in association with that class 

30 II MHC molecule. If these characteristics are confirmed 
for multiple MHC class II alleles, then the peptide is 
likely to have widespread utility in patients expressing 
different class II alleles - 
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CD4"*' T cell responses can be measured by a variety of 
in vitro methods known in the art. For example, whole 
peripheral blood mononuclear cells (PBMC) can be cultured 
with and without a candidate synthetic peptide and their 
5 proliferative responses measured by, e.g,, incorporation 
of [^H] -thymidine into their DNA. That the proliferating 
T cells are CD4* T cells can be tested by either 
eliminating 004*** T cells from the PBMC prior to assay or by 
adding inhibitory antibodies that bind to the CD4'*" molecule 

10 on the T cells, thereby inhibiting proliferation of the 

latter. In both cases, the proliferative response will be 
inhibited only if CD4**' T cells are the proliferating cells. 

Alternatively, CD4* T cells can be purified from PBMC 
and tested for proliferative responses to the peptides in 

15 the presence of APC expressing the appropriate class II 
MHC molecule. Such APC can be B-lymphocytes, monocytes, 
macrophages, or dendritic cells, or whole PBMC, APC can 
also be immortalized cell lines derived from B- 
lymphocytes, monocytes, macrophages, or dendritic cells. 

20 The APC can endogenously express the class II MHC molecule 
of interest or they can express transfected 
polynucleotides encoding such molecules. Where the 
subjects are humans, the APC can also be T cells since 
human T cells are capable of expressing class II MHC 

25 molecules. In all cases the APC can, prior to the assay, 
be rendered non-proliferative by treatment with, e.g., 
ionizing radiation or mitomycin-C, 

As an alternative to measuring cell proliferation, 
cytokine production by the CD4'*" T cells can be measured by 

30 procedures known to those in art- Cytokines include, 

without limitation, interleukin-2 (IL-2), IFN-y, IL-4, IL- 
5, TNF-a, interleukin-3 (IL-3) , interleukin-6 (IL-6) , 
interleukin-10 (IL-10), interleukin-12 (IL-12), GM-CSF, 
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RANTES, MlP-la, MIP-lp and transforming growth factor (} 
(TGFp) • Assays to measure them include, without 
limitation, ELISA, ELISPOT and bio-assays in which cells 
responsive to the relevant cytokine are tested for 
5 responsiveness (e.g., proliferation) in the presence of a 
test sample. Alternatively, cytokine production by CD4'*' 
lymphocytes can be directly visualized by intracellular 
immunofluorescence staining and flow cytometry. 

Once peptide epitopes associated with a particular 

10 disease have been identified, the EV described above can 
be used as a diagnostic test for the disease. Thus, 
lymphocytes from a subject suspected of having or being 
susceptible to the disease can be tested by any of the 
described methods for a 004"^ T lymphocyte response to one 

15 or more (e.g., 2, 3, 4, 5, 6, 10, 15, or 20) appropriate 
peptides. If a significant CD4'*' T lymphocyte is detected, 
it is likely that the subject has or will develop the 
disease. The disease can be, for example, cancer and the 
peptides can be derived from, for example, CEA. 

20 Appropriate peptides can be, for example, any of those 

listed below {e.g., those with SEQ ID N0S:l-7 and 13-15). 

As an alternative to the above-described EV, peptides 
identified by the IMF can be tested for their ability to 
bind to an appropriate class II MHC molecule by methods 

25 known in the art using, for example, isolated class II MHC 
molecules or cells transfected with nucleic acid molecules 
encoding class II MHC molecules- One such method is 
described in Example 2. These binding assays can also be 
used to test the ability of the peptides to bind to 

30 alternative class II MHC molecules, i.e., class II MHC 
molecules other than those from which they were eluted 
using the IMF method of the invention. Once such 
alternative class II MHC molecules are shown to bind the 
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peptide (s), the diagnostic methods of the invention (using 
such peptides) and therapeutic methods of the invention 
(using either the peptides or APL derived from them) can 
be applied to subjects expressing such alternative class 
5 II MHC molecules. 

1 . 3 Diseases and their associations with class II 
MHC genes 

The methods of the invention can be applied to the 
analysis of peptides involved in diseases associated with 

10 expression of defined class II MHC molecules and in which 
pathology or protection is due to the action of activated 
CD4''' T cells. Such diseases include, without limitation, 
certain infectious diseases, cancer, and autoimmune 
diseases. Three are discussed below, 

15 An example of an infectious disease that fulfills the 

above criteria is human leprosy, which is caused by 
MycoJbacteriujn leprae. The bacteria infect and thrive in 
peripheral Schwann cells and macrophages. The disease is 
characterized by depressed cellular immunity but normal 

20 antibody responses. Leprosy has been associated with the 
expression of DRBl class II MHC molecules in which codon 
13 encodes Arg or codons 70 and 71 encode Arg [Zerva et 
al. (1996), J. Exp. Med. 183: 829-836]. 

The ability to spontaneously clear hepatitis C virus 

25 is associated with expression of DQB1*0301 molecules. 

Since DQB1*0302 is under-represented in hepatitis virus C 
infected subjects, DQB1*0302 expressing individuals may be 
protected from infection with the virus [Cramp et al. 
(1998), J. Hepatol. 29:207-213]. 

30 Melanoma cell-specific CD4* T cells, which may be - 

involved in protective immune responses to malignant 
melanoma, recognize tyrosinase epitopes presented by HLA- 
DRB1*0401 class II molecules [Topalian et al. (1996), 
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supra] . With respect to cancers, the reticular cell 
sarcomas of SJL mice are dependent for growth on cytokines 
produced by activated CD4"^ T cells and require the 
expression of certain class II MHC molecules . 
5 Examples of autoimmune diseases to which the methods 

of the invention can be applied include, without 
limitation, IDDM, rheumatoid arthritis (RA) , multiple 
sclerosis (MS) , systemic lupus erythmatosus (SLE) , and 
myasthenia gravis (MG) . RA is associated with expression 

10 of DRBl alleles (DRB1*0101, 0401, 0403, and 0405) . MS is 
associated with expression of DRB1*1501, DQA1*0102 and 
DQB1*0602 alleles. SLE is associated with the expression 
of DRB1*03, DRB1*1501, DQA1*0501 and DQB1*0201 alleles. 
MG is associated with the expression of DR3 and DQ2 

15* {DQA1*0501-DQB1*0201 and DQA1*0201-DQB1*0201) alleles. 
Autoimmune ovarian failure is associated with DQBl genes 
encoding Asp at position 57. Graves" thyroiditis, 
Hashimoto's thyroiditis, and primary hypothyroidism all 
show weak association with the expression of the DR5 and 

20 DR3 alleles. Coeliac disease is associated with the 
expression of HLA-DQA1*0501 and DQB1*0201 alleles. 
Primary biliary cirrhosis is associated with the 
expression of DRB1*0801-DQA1*0401/0601-DQB1*04 alleles. 
Autoimmune hepatitis is associated with the expression of 

25 DRB3*0101 and DRB1*0401 alleles. Addison's disease is 
associated with the expression of DRB1*03, DQA1*0501 and 
DQB1*0201 alleles. Vitiligo is associated with the 
expression of DRB1*0701 and DQ2 alleles. Anti-glomerular 
basement membrane disease (Goodpasture's syndrome) is 

30 associated with the expression of DR15 and DR4 alleles. 

Pathology in RA, MS, and IDDM is considered to be due 
predominantly to CD4* T cell-dependent, cell-mediated 
autoimmune responses, while that of SLE and MG is due 
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predominantly to CD4* T cell-dependent, antibody-mediated 
autoimmune responses. In RA the inflammatory response 
induced by the activated CD4* T cells is focused on joint 
synovia, in MS on neural myelin sheaths, and in IDDM on 
5 pancreatic p cells located in the islets of Langerhans. 
SLE is a systemic autoimmune disease involving multiple 
organs. The muscle fatigue observed in MG is due to the 
development in the patient of antibodies that bind to the 
acetylcholine receptor in neuromuscular junctions. 
10 Other MHC class-II associated diseases are listed 

above . 

1.4 Species 

The methods of the invention can be applied to 
15 diseases with the described characteristics in a wide 
range of mammalian species, e.g., humans, non-human 
primates, horses, cattle, pigs, sheep, goats, dogs, cats, 
rabbits, guinea pigs, hamsters, rats, and mice. They will 
preferably be applied to diseases of humans. 

20 

1.5 Class II MHC Molecules 

Class II MHC molecules have been identified in 
multiple mammalian species. In some of these species, 
expression of a particular class II MHC molecule has been 
25 associated with a particular CD4"*' T cell-mediated diseases 
(see above) . In humans, for example, the class II MHC 
molecules are designated HLA-DR, HLA-DQ, and HLA-DP and in 
mice, H-2A and H-2E. In all species, there are multiple 
alleles of each gene. 

30 

1.6 Antigen Presenting Cells (APC) 

APC that can be used for the IMF methods of the 
invention can be any mammalian cell that expresses class 
II MHC molecules on its surface, e.g., those listed above 
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for use in EV (B lymphocytes, macrophages/ monocytes, 
dendritic cells, and, in humans, T cells) • APC can also 
be tumor cells, e.g., B cell lymphoma cells or melanoma 
cells. It is also not required that the cells 
5 constitutively express class II MHC molecules. Class II 
MHC can be induced (in vitro or in vivo) (e.g., by IFN-y) 
in such cells. Alternatively, immortalized lines of such 
cells can be used. 

10 1.7 Polypeptide Antigens 

Polypeptide antigens that can be used with the IMF 
methods can be those with a known amino acid sequence or 
those in which at least part of the amino acid sequence is 
known. They can be polypeptides that themselves are known 

15 or suspected to be involved in the disease process or 

immunity to the disease (e.g., CEA in cancer) or they can 
be derived from microbial organisms known or suspected to 
be involved in the disease process (e.g., M. leprae in 
leprosy) . Examples of other polypeptide antigens include 

20 the core and viral coat proteins of viruses such as 
hepatitis C virus, the heat shock proteins of 
mycobacteria, and tyrosinase in melanoma. Furthermore, 
the polypeptide antigen can be the full-length protein or 
it can be a fragment of the protein known or suspected to 

25 be involved in the disease process. 

2. Peptides 

Peptides of the invention include, but are not 
limited to, peptides that bind to class II MHC molecules 
30 and activate CD4* T cells involved in a disease process or 
protection from a disease. The class II MHC molecule can 
be a class II MHC molecule that is associated with 
susceptibility or resistance to a disease. Diseases can 
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be any of the diseases cited herein and the species from 
which the class II MHC molecules and/or peptides are 
obtained can be any of those cited herein. The class II 
MHC molecules are preferably human class II HLA molecules, 
5 i.e., DR, DP or DQ molecules. The peptides can be, for 
example, peptides that bind to DR4 or DRl molecules. The 
polypeptides from which the peptides of the invention are 
derived can be any of those cited herein. The peptides 
generally are 9 to 30 (e.g., 13 to 25) amino acids in 
10 length. 

The peptides can be derived, for example, from CEA 
and can bind to HLA-DRl or HLA-DR4 molecules. The 
peptides can be, for example, any one of the following 
peptides: 

15 KEVLLLVHNLPQH (SEQ ID N0:1); KEVLLLVHNLPQHL (SEQ ID NO: 2;) 
KEVLLLVHNLPQHLF (SEQ ID NO: 3); KEVLLLVHNLPQHL FG (SEQ ID 
NO: 4); GKEVLLLVHNLPQHL (SEQ ID NO: 5); EGECEVLLLVHNLPQHLFG 
(SEQ ID NO: 6); YLWWVNGQSLPVSPR (SEQ ID N0:7); 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID 

20 NO: 14); or NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15), Also 
included are peptides containing any of the above 
sequences, plus 1-15 residues on either end or 1-15 
residues on each end. 

In addition, the peptides can be isolated fragments 

25 of CEA which are shorter than full-length CEA. In such 
peptides, the fragments can contain one or more (e.g., one 
two, three, four, five, six, seven, eight, nine, or ten) 
amino acid sequences which can be, for example, those 
amino acid sequences with SEQ ID N0S:l-7 and 13-15. 

30 The peptides can also be peptides that include one or 

more CEA segments (e.g., SEQ ID N0S:l-7 and 13-15) and one 
or more (e.g., one, two, three, four, five, six, seven, 
eight, nine, ten, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
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21, 22, 23, 24, 25, 30, 35, 40, 50, 55, 60, 65, 70, 75, 
80, 90, 100, 150, 200, 250, 300, 400, 500, or more) amino 
acids at the C-terminus, the N-terminus, or both the C- 
terminus and the N-terminus of the CEA segment. The 
5 additional amino acid residues can be CEA or non-CEA amino 
acids. The peptide will, however, not be identical to 
full-length CEA. 

In addition, peptides identified as being associated 
with any of the diseases listed herein (e.g., cancer such 

10 as colon cancer or any of the infectious diseases recited 
herein) can be used to activate lymphocytes (e.g., CD4+ 
lymphocytes) that cause, directly or indirectly, the death 
of pathogenic target cells such as cancer cells or 
pathogen- infected cells. Alternatively, in other diseases 

15 such as the autoimmune diseases recited herein, the 

peptides can be used to induce immunological tolerance in 
lymphocytes (e.g., CD4+ T lymphocytes) associated with the 
initiation, progress, or pathological symptoms of the 
disease. Tolerization of these lymphocytes can be useful 

20 for prophylaxis against and/or therapy of the relevant 
disease. Lymphocyte activation or tolerization can be 
achieved by administering an appropriate peptide to a 
subject. Methods of testing for efficacy of a peptide in 
activating or in inducing tolerance in lymphocytes (e.g., 

25 CD4+ lymphocytes), methods and routes of administration, 
and doses to be administered are essentially the same as 
those described below for APL. Furthermore, the peptides 
can be modified in any of the ways described below for 
APL. In addition, peptidomimetic forms of the peptides 

30 can be produced by methods known in the art (see below in 
section on APL) . The peptides can be fragments of any the 
polypeptides disclosed herein, e.g., CEA, PSA, insulin, 
proinsulin, preproinsulin, GAD65, IA-2, or phogrin. They 
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can be, for example, those with SEQ ID N0S:l-7 and 13-15 
or any of those described above. 

The invention also includes a method of activating or 
tolerizing CD4^ T cells in which full-length CEA or DNA 
5 encoding full-length CEA is administered to a mammal. 

Following this administration, CD4**' T cells of the mammal 
are tested for responsiveness to a naturally processed 
fragment of CEA. Such a fragment can be, but is not 
limited to, a fragment with any of SEQ ID N0S:l-7 and 13- 
10 15. 

The CEA peptides of the invention {e.g., those with 
SEQ ID N0S:l-7 and 13-15) can be used for purposes other 
than therapy. They can be used, for example, in 
diagnostic assays and for methods of screening for 

15 reagents that bind to complexes of class II MHC molecules 
and CEA peptides (see below) . 

The peptides can be prepared using the described IMF 
methodologies. Smaller peptides (fewer than about 50 
amino acids long) can also be conveniently synthesized by 

20 standard chemical means. In addition, both polypeptides 
and peptides can be produced by standard in vitro 
recombinant DNA techniques, and in vivo transgenesis using 
the nucleotide sequences encoding the appropriate 
polypeptides, peptides or APL. Methods well known to 

25 those skilled in the art can be used to construct 

expression vectors containing relevant coding sequences 
and appropriate transcriptional/translational control 
signals- See , for example, the techniques described in 
Sambrook et al.. Molecular Cloning: A Laboratory Manual, 

30 Cold Spring Harbor Laboratory, N.Y., 1989, and Ausubel et 
al-. Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N-Y., 1989- 
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The invention also features isolated nucleic acid 
molecules encoding the peptides of the invention. These 
nucleic acid molecules can be cDNA, genomic DNA, synthetic 
DNA/ or RNA, and can be double-stranded or single-stranded 
5 (i.e., either a sense or an antisense strand). Segments 
of these molecules are also considered within the scope of 
the invention, and can be produced, for example, by the 
polymerase chain reaction (PGR) or generated by treatment 
with one or more restriction endonucleases . A ribonucleic 

10 acid (RNA) molecule can be produced by in vitro 

transcription. Preferably, the nucleic acid molecules 
encode peptides that, regardless of length, are soluble 
under normal physiological conditions. 

The nucleic acid molecules of the invention can 

15 contain naturally occurring sequences, or sequences that 
differ from those that occur naturally, but, due to the 
degeneracy of the genetic code, encode the same 
polypeptide (for example, the peptides with SEQ ID N0S:1~7 
and 13-15) . In addition, these nucleic acid molecules are 

20 not limited to coding sequences, e.g., they can include 
some or all of the non-coding sequences that lie upstream 
or downstream from a coding sequence. 

The nucleic acid molecules of the invention can be 
synthesized (for example, by phosphoramidite-based 

25 synthesis) or obtained from a biological cell, such as the 
cell of a mammal. Thus, the nucleic acids can be those of 
a human, non-human primate (e.g., monkey), mouse, rat, 
guinea pig, cow, sheep, horse, pig, rabbit, dog, or cat. 
In addition, the isolated nucleic acid molecules of 

30 the invention encompass segments that are not found as 
such in the natural state. Thus, the invention 
encoit^asses recombinant nucleic acid molecules (for 
example, isolated nucleic acid molecules encoding any of 
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the peptides described herein) incorporated into a vector 
(for example, a plasmid or viral vector) or into the 
genome of a heterologous cell {or the genome of a 
homologous cell, at a position other than the natural 
5 chromosomal location) . 

Techniques associated with detection or regulation of 
genes are well known to skilled artisans and such 
techniques can be used to diagnose and/or treat disorders 
associated with aberrant CEA expression, e.g., colon 

10 cancer. Hybridization can be used as a measure of 
homology between two nucleic acid sequences. Thus a 
nucleic acid encoding a peptide of the invention (e.g., a 
CEA peptide such as a peptide with SEQ ID NOs: 1-7 or IS- 
IS), or a portion of such a nucleic acid, can be used as 

15 hybridization probe according to standard hybridization 
techniques. The hybridization of a CEA peptide probe to 
DNA or RNA from a test source (e.g., a mammalian cell) is 
an indication of the presence of CEA DNA or RNA, 
respectively, in the test source. Hybridization 

20 conditions are known to those skilled in the art and can 
be found in Current Protocols in Molecular Biology, John 
Wiley & Sons, N.Y., 6.3.1-6.3.6, 1991. Moderate 
hybridization conditions are defined as equivalent to 
hybridization in 2X sodium chloride/sodium citrate (SSC) 

25 at 30^Cf followed by one or more washes in 1 X SSC, 0.1% 

SDS at 50-60*^C. Highly stringent conditions are defined as 
equivalent to hybridization in 6X sodium chloride/sodium 
citrate (SSC) at 45^C, followed by one or more washes in 
0.2 X SSC, 0.1% SDS at 50-65°C. 

30 The invention also encompasses: (a) vectors that 

contain any of the foregoing peptide coding sequences 
and/or their complements (that is, "antisense" sequence) ; 
(b) expression vectors that contain any of the foregoing 
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peptide coding sequences operatively associated with any 
transcriptional/translatiorial regulatory element (examples 
of which are given below in the section on APL) necessary 
to direct expression of the coding sequences; 
5 (c) expression vectors containing, in addition to 

sequences encoding a peptide of the invention, nucleic 
acid sequences that are unrelated to nucleic acid 
sequences encoding the peptide of the invention, such as 
nucleic acid sequences encoding a reporter, marker, or a 

10 signal peptide (e.g., a heterologous signal peptide); and 
(d) genetically engineered host cells that contain any of 
the foregoing expression vectors and thereby express the 
nucleic acid molecules of the invention. 

Where the nucleic acids form part of a hybrid gene 

15 encoding additional polypeptide sequences, for example, 
sequences that function as a marker or reporter, markers 
or reporter genes include p-lactamase, chloramphenicol 
acetyltransf erase (CAT), adenosine deaminase (ADA), 
aminoglycoside phosphotransferase (neo^, G418^) , 

20 dihydrofolate reductase (DHFR), hygromycin-B- 

phosphotransf erase (HPH) , thymidine kinase (TK) , lacZ 
(encoding p-galactosidase) , and xanthine guanine 
phosphoribosyltransf erase (XGPRT) . As with many of the 
standard procedures associated with the practice of the 

25 invention, skilled artisans will be aware of additional 
useful reagents, for example, additional sequences that 
can serve the function of a marker or reporter. 
Generally, the hybrid polypeptide will include a first 
portion and a second portion, the first portion being a 

30 peptide of the invention (e.g., a peptide with any of SEQ 
ID N0S:l-7 or 13-15) and the second portion being, for 
example, an immunoglobulin constant region. 



- 39 - 



wo 2002/022803 



PCTAJS2001/028467 



A variety of host-expression vector systems can be 
used to express the peptides and polypeptides. Such host- 
expression systems represent vehicles by which the 
polypeptides of interest can be produced and subsequently 
5 purified, but also represent cells that can, when 

transformed or transfected with the appropriate nucleotide 
coding sequences, produce the relevant peptide or 
polypeptide in situ. These include, but are not limited 
to, microorganisms such as bacteria, e.g., E. coli or 

10 B. subtlliSf transformed with recombinant bacteriophage 
DNA, plasmid or cosmid DNA expression vectors containing 
polypeptide coding sequences; yeast, e.g., Saccharomyces 
or Pichia, transformed with recombinant yeast expression 
vectors containing the appropriate coding sequences; 

15 insect cell systems infected with recombinant virus 

expression vectors, e.g., baculovirus; plant cell systems 
infected with recombinant virus expression vectors, e.g., 
cauliflower mosaic virus (CaMV) or tobacco mosaic virus 
(TMV) , or transformed with recombinant plasmid expression 

20 vectors, e.g., Ti plasmids, containing the appropriate 
coding sequences; or mammalian cell systems, e.g., COS, 
CHO, BHK, 293 or 3T3, harboring recombinant expression 
constructs containing promoters derived from the genome of 
mammalian cells, e.g., metallothionein promoter, or from 

25 mammalian viruses, e.g., the adenovirus late promoter or 
the vaccinia virus 7.5K promoter. 

3. APL 

An altered peptide ligand (APL) is a variant peptide 
30 in which 1-6 (i.e., 1, 2, 3, 4, 5, or 6) amino acid 

residues of a parent wild-type peptide that activates a 
response in CD4* T cells have been changed. In an APL of 
the invention, fewer than half of the residues of the 
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wild-type peptide are changed, e,g., fewer than 50%, fewer 
than 40%, fewer than 30%, fewer than 25%, fewer than 20%, 
fewer than 15%, fewer than 10%, or fewer than 5%. Thus, 
for example, in an APL derived from a wild- type peptide 
5 20 amino acids long and differing from the wild-type 
peptide at 6 positions, 30% of the amino acids of the 
wild- type peptide are changed. Alternatively, in an APL 
derived from a wild- type peptide 15 amino acids long and 
differing from the wild-type peptide at 3 positions, 20% 

10 of the amino acids of the wild-type peptide are changed. 

An APL retains at least some ability to bind to the 
class II MHC molecule to which the parent peptide binds 
and at least some ability to be recognized by the antigen- 
specific T lymphocyte receptor (s) of the CD4* T cell(s) 

15 that recognize the parent peptide bound to the Scime type 
of class II MHC molecule. However, by definition, the APL 
activates a response in the CD4'*' T cells that is 
qualitatively different from that activated by the parent 
peptide. For example, while the parent peptide can 

20 activate a helper T cell 1- (Thl-)type response in which 
the cytokines interleukin-2 (IL-2), interferon-y (IFN-y) , 
and tumor necrosis factor-a (TNF-a) are produced by the 
activated CD4* T cells, an APL derived from this parent 
peptide might instead activate a helper T cell 2- (Th2-) 

25 type response in the CD4* T cells. In a Th2 response, the 
cytokines interleukin-4 (IL-4), interleukin-5 (IL-5) , and 
interleukin-10 (IL-10) are produced by the activated CD4* 
T cells. Alternatively, if a particular parent peptide 
elicits a Th2 response in a given CD4* T cell, an APL 

30 derived from the parent peptide could activate a Thl 
response in the T cell. Some APL have been shown to 
switch a Thl response to a ThO response in which both Thl- 
and Th2-type cytokines are produced. Furthermore, an APL 
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could redirect a CD4"*" T cell response towards a Th3-type 
response in which the predominant cytokine produced is 
transforming growth factor-p (TGF-P) . TGF-p has been 
shown. to be suppressive of a wide range of immune 
5 responses. 

In general, Thl responses are associated with cell- 
mediated immune responses and Th2 responses are associated 
with antibody- (i.e., B cell-) mediated immune responses. 
Thus, the relative number of CD4''' T cells responding in a 
10 Thl- versus a Th2-type fashion will determine the nature 
(cell-mediated versus antibody-mediated) of the immune 
response generated by an antigen in a particular 
individual . 

Some conditions, and in particular autoimmune 
15 diseases (e.g., RA, IDDM, and MS), have been shown to be 
due to cellular immune responses and thus to be dependent 
on Thl CD4'*' T cell responses. Other diseases (e.g., MG and 
SLE) have been shown to be mediated by antibody (i.e., B- 
cell) responses, and thus to be dependent on Th2 CD4'*' T 
20 cell responses. Thus, an APL that serves to direct a 

T cell response from a Thl to a Th2 response can be useful 
in treatment or prevention of the first category of 
diseases and an APL that serves to alter a CD4'^ T cell 
response in a Th2 to Thl direction can be useful in the 
25 treatment or prevention of the second category of 
diseases. 

The amino acid substitutions in APL can be radical. 
For example, an amino acid with a positively charged side 
chain (e.g., lysine) can be replaced by an amino acid with 
30 a negatively charged side chain (e.g., aspartic acid) or a 
hydrophobic side chain (e.g., isoleucine) and vice versa. 
In addition, an amino acid with a bulky side chain (e.g., 
tryptophan) can be replaced with an amino acid with small 
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side chain (e.g., glycine or alanine) and vice versa. 
Alternatively, the substitutions can be conservative. For 
example, a negatively charged amino acid can be replaced 
with another negatively charged amino acid (e.g., aspartic 
5 acid with glutamic acid) or one hydrophobic amino acid 
with another hydrophobic amino acid (e.g., leucine with 
valine or isoleucine) . 

Methods to test whether a given APL elicits a 
predominantly Thl, Th2, Th3, ThO, or other CD4* T cell 

10 response are known in the art. In brief, an APL of 

interest can be administered to a test subject (e.g., a 
mouse) expressing a class II MHC molecule of interest 
(e.g., a human class II MHC molecule) by any one of a 
variety of routes, e.g., intramuscular, intravenous, 

15 subcutaneous, intradermal, intraperitoneal, intrarectal, 
intravaginal, intranasal, intragastric, intratracheal, or 
intrapulmonary . In addition, administration can be oral 
or transdermal, employing a penetrant such as a bile salt, 
a fusidic acid or another detergent. The injections can 

20 be single or multiple (e.g., 2-, 3-, 4-, 6~, 8-, or 10- 
fold) . The peptide can be administered in a 
physiologically acceptable solution (e.g., a saline 
solution) and can be administered with or without an 
adjuvant (e.g., Freund's complete or incomplete adjuvant 

25 or cholera toxin) . After immunization, the animal can be 
challenged with the APL, either in vivo or in vitro , by 
methods known in the art, and the levels of individual 
cytokines produced measured. In the case of an in vivo 
challenge, the cytokine secreted into the blood or some 

30 other bodily fluid (e.g., urine, saliva, or semen) or 
lavage (e.g., nasal, pulmonary, rectal, gastric, or 
vaginal lavages) can be measured. Alternatively, lymphoid 
cells can be isolated from the animal after challenge and 
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the level of cytokines produced by the cells can be 
tested, e.g., by culturing and measurement of cytokine 
levels in culture supernatant by, e.g., ELISA. Isolated 
lymphoid cells can also be tested for relative numbers of 
5 cells producing the cytokines by assays such as the 
ELISPOT assay or fluorescence analysis following 
intracellular staining with one or more cytokine binding 
antibodies, each conjugated with a different fluorophore 
which emits light of a distinct wavelength. Fluorophores 

10 fluorescing at different wavelengths (i.e., colors) are 
known in the art. Using such fluorescence assays, it is 
possible to ascertain the range of cytokines being 
produced by a single cell. If the lymphoid cells are 
challenged in vitro, assays such as the ELISPOT assay or 

15 ELISA can be used. Should immunization and challenge with 
an APL result in relatively low levels of IL-2, IFN-y, and 
TNF-a and relatively high levels of IL-4, IL-5, and IL-IO^ 
while immunization and challenge with the parental peptide 
results in the inverse pattern, the conclusion would be 

20 that the APL is useful for switching a response from a Thl 
to a Th2 pattern of cytokine production. Where the Thl 
response is pathogenic, treatment with the APL can be 
therapeutic or prophylactic. Similarly, APL and their 
parent peptides can be tested for their relative abilities 

25 to shift a response from a Th2 type response to a Thl type 
response, or from a Thl-type response to a ThO- or a Th3- 
type response. The ability of an APL to activate 
responses such as CD4* CTL responses, anti-angiogenic 
responses, or macrophage/eosinophil activating responses 

30 can be determined by methods familiar to those in the art. 
APL can also be tested by any of the protocols in 
human volunteers. Alternatively, lymphoid cells could be 
isolated from a subject (e.g., a human subject) and both 
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immunized and challenged in vitro. In addition, APL can 
be administered to "SCID-Hu" mice, which are mice 
genetically deficient in murine T and B lymphocytes and 
reconstituted with human lymphoid cells. Due to the 
5 inherent immunological deficiency in these animals, the 
human lymphoid cells are not rejected and will engraft. 
After immunizing and challenging these mice with an APL 
(using any of methodologies described above), their 
cytokine responses can be measured by any of the methods 

10 described above. To ensure that the cytokines detected in 
the assays are of human origin, human species-specific 
reagents (e.g., antibodies) can be used for the assays 
(e.g., ELISA or ELISPOT) . Furthermore, in order to 
exclude presentation of the APL to human CD4'*' T cells by 

15 murine class II MHC molecules, SCID mice could be bred 
with class II MHC "knockout" mice in order to generate 
mice deficient in both lymphocytes and murine class II MHC 
molecules. By reconstituting the resulting mice with 
human lymphoid cells, a SCID-Hu mouse is provided in which 

20 essentially the only CD4'*" T cells capable of responding are 
human CD4* T cells and the only class II MHC molecules 
capable of presenting the APL are human class II MHC 
molecules on the surface of human lymphoid cells. 
Alternatively, the recipient of human lymphoid cells could 

25 be a hybrid mouse derived by breeding SCID mice with DR 
(e.g., DR4) or DQ (e.g., DQ8) transgenic mice made on a 
class II MHC knockout background. Again the only class II 
MHC molecules present would be the human DR or DQ 
molecules contributed by the transgenic parental mice. 

30 ElAG-1 deficient mice can be used instead of the SCID 

mice for generation of the described human/mouse chimeric 
animals. RAG-1 deficient mice, like SCID mice, lack T and 
B lymphocytes but have the advantage that the relevant 
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mutation is not "leaky." Thus, while late in life SCID 
mice can develop a low number of lymphocytes/ this does 
not occur in RAG-1 deficient mice. 

The APL of the invention can be obtained by any of 
5 the methods described above for peptides and polypeptides. 
APL of the invention also include those described above, 
but modified for in vivo use by the addition, at either or 
both the amino- and carboxyl- terminal ends, of a blocking 
agent to facilitate survival of the relevant peptide in 

10 vivo. This can be useful in those situations in which the 
peptide temini tend to be degraded by proteases in vivo. 

Alternatively, blocking agents such as pyroglutamic 
acid or other molecules known in the art can be attached 
to the amino and/or carboxyl terminal residues, or the 

15 amino group at the amino terminus or carboxyl group at the 
carboxyl terminus can be replaced with a different moiety. 
Likewise, the peptides can be covalently or noncovalently 
coupled to pharmaceutically acceptable "carrier" proteins 
prior. to administration. 

20 Also of interest are peptidomimetic compounds that 

are designed based upon the amino acid sequences of the 
APL. Peptidomimetic compounds are synthetic, non-peptide 
compounds having a three-dimensional conformation (i.e., a 
"peptide motif") that is substantially the same as the 

25 three-dimensional conformation of a selected peptide. The 
peptide motif provides the peptidomimetic compound with 
the ability to activate CDA^ T cells in a manner 
qualitatively identical to that of the APL from which the 
peptidomimetic was derived. Peptidomimetic compounds can 

30 have additional characteristics that enhance their 

therapeutic utility, such as increased cell permeability 
and prolonged biological half -life. 
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The peptidomimetics typically have a backbone that is 
partially or completely non-peptide, but with side groups 
that are identical to the side groups of the amino acid 
residues that occur in the peptide on which the 
5 peptidomimetic is based. Several types of chemical bonds, 
e.g., ester, thioester, thioamide, retroamide, reduced 
carbonyl, dimethylene and ketomethylene bonds, are known 
in the art to be generally useful substitutes for peptide 
bonds in the construction of protease-resistant 
10 peptidomimetics. 

4. Methods of Therapy Using APL 

An APL that has the ability to elicit a cytokine 

response in CDA* T cells that is non-pathogenic and/or is 

15 suppressive of a pathogenic CD4"^ T cell cytokine response 

elicited by the APL's parental peptide, could be useful in 

therapy, palliation, or prophylaxis of a disease caused by 

the pathogenic CDi^ T lymphocyte response to the parental 

peptide. 

20 For example, if "peptide x". elicits a potent Thl-type 

response in CD4*^ T cells in a patient with a HLA-DR4, 
treatment of the patient with a peptide x-derived APL that 
elicits a Th2 CD4"*' T cell response can be therapeutic or 
palliative in that patient. Alternately, if peptide x 

25 elicits a Thl response, an APL derived from peptide x 
could elicit a CD4'*" T-cell response that enhances (a) 
migration of CDS'*' tumoricidal CTL to the vicinity of the 
tumor cells for identification and specific killing of the 
tumor cells; or (b) migration of tumoricidal macrophages 

30 and eosinophils to the vicinity of the tumor cells and 
thereby enhance killing of the tumor cells. 
Alternatively, an APL could elicit an anti-angiogeneic 
response in the microenvironment of the tumor that cuts 
off the nutrient supply needed for continued tumor cell 
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division and consequent tumor growth. Furthermore, an APL 
derived from a tumor antigen peptide could activate a CDA^ 
CTL response more effectively than the parent peptide from 
which the APL was derived. 
5 These methods of the invention fall into 2 basic 

classes, i.e., those using in vivo approaches and those 
using ex vivo approaches. 

4.1 In Vivo Approaches 

10 In one in vivo approach, the APL (peptide or 

peptidomimetic) itself is administered to the subject by 
any of the routes listed above. It is preferably 
delivered directly to an appropriate lymphoid tissue (e.g. 
spleen, lymph node, or mucosal-associated lymphoid tissue 

15 (MALT) ) . 

The dosage required depends on the choice of APL, 
the route of administration, the nature of the 
formulation, the nature of the patient's illness, and the 
judgment of the attending physician- Suitable dosages are 

20 in the range of 0.1-100.0 }xg/kg. Wide variations in the 
needed dosage are to be expected in view of the variety of 
APL available and the differing efficiencies of various 
routes of administration. For example, oral 
administration would be expected to require higher dosages 

25 than administration by i.v. injection. Variations in 
these dosage levels can be adjusted using standard 
empirical routines for optimization, as is well understood 
in the art. 

Alternatively, a polynucleotide containing a 
30 "minigene" encoding the APL can be delivered to an 
appropriate cell of the animal. Expression of the 
minigene will preferably be directed to lymphoid tissue of 
the subject by, for example, delivery of the 
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polynucleotide to the lymphoid tissue. This can be 
achieved by, for example, the use of a polymeric, 
biodegradable microsphere, microparticle or microcapsule 
delivery vehicle, sized to optimize phagocytosis by 
5 phagocytic cells such as macrophages. For example, PLGA 
(poly-lacto-co-glycolide) microparticles approximately 1- 
10 pm in diameter can be used. The polynucleotide is 
encapsulated in these microparticles, which are taken up 
by macrophages and gradually biodegraded within the cell, 
10 thereby releasing the polynucleotide. Once released, the 
DNA is expressed within the cell. A second type of 
microparticle is intended not to be taken up directly by 
cells, but rather to serve primarily as a slow-release 
reservoir of nucleic acid that is taken up by cells only 
15 upon release from the micro-particle through 

biodegradation. These polymeric particles should 
therefore be large enough to preclude phagocytosis (i.e., 
larger than 5pm and preferably larger than 20pm) . 
Microparticles useful for nucleic acid delivery, methods 
20 for making them, and methods of use are described in 

greater detail in U.S. Patent No. 5,783,567, incorporated 
herein by reference in its entirety. Microparticles may 
also be made, for example, by the methods of Mathiowitz et 
al. (WO 95/2429), incorporated herein by reference in its 
25 entirety. 

Alternatively, a polynucleotide containing a 
"minigene" encoding the APL can be delivered in a 
pharmaceutically acceptable carrier such as saline, 
lipids, liposomes, particulates, virus-like particles, or 
30 nanospheres; as colloidal suspensions; or as powders. The 
nucleic acid can be naked or associated or complexed with 
a delivery vehicle. For a description of the use of naked 
DNA, see, e.g., U.S. Patent No. 5,693,622. Nucleic acids 
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and polypeptides can be delivered using delivery vehicles 
known in the art, such as ISCOMS, microspheres, 
microcapsules, microparticles, gold particles, virus-like 
particles, nanoparticles, polymers, condensing agents, 
5 polysaccharides, polyamino acids, dendrimers, saponins, 
adsorption enhancing materials, or fatty acids. Viral 
particles can also be used, e.g., retroviruses, 
adenovirus, adeno-associated virus, pox viruses, SV40 
virus, alpha virus or herpes viruses. 

10 The vectors can be incorporated alone into these 

delivery vehicles or co-incorporated with tissue-specific 
antibodies. Alternatively, one can prepare a molecular 
conjugate composed of a plasmid or other vector attached 
to poly-L-lysine by electrostatic or covalent forces. 

15 Poly-L-lysine binds to a ligand that can bind to a 

receptor on target cells [Cristiano et al. (1995), J. Mol. 
Med. 73:479]. Alternatively, lymphoid tissue specific 
targeting can be achieved by the use of lymphoid tissue- 
specific transcriptional regulatory elements (TRE) such as 

20 a B lymphocyte, T lymphocyte, or, optimally, dendritic 

cell specific TRE. Lymphoid tissue specific TRE are known 
[Thompson et al. (1992), Mol. Cell. Biol. 12:1043-1053; 
Todd et al. (1993), J. Exp. Med. 177:1663-1674; Penix et 
al. (1993), J. Exp. Med, 178:1483-1496]. 

25 The nucleic acid sequence encoding an APL of interest 

with an initiator methionine and optionally a trafficking 
sequence is operatively linked to a promoter or enhancer- 
promoter combination in the expression vector. 

Short amino acid sequences can act as signals to 

30 target proteins to specific intracellular compartments. 
For example, hydrophobic signal peptides 
(e.g., MAISGVPVLGFFIIAVLMSAQESWA, SEQ ID N0:8) are 
typically found at the amino terminus of proteins destined 
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for the ER. The sequence KFERQ (SEQ ID NO: 9) (and other 
closely related sequences) is known to target 
intracellular polypeptides to lysosomes, while other 
sequences 

5 (e.g., MDDQRDLISNNEQLP, SEQ ID NO:10) target polypeptides 
to endosomes. In addition, the peptide sequence KDEL (SEQ 
ID NO: 11) has been shown to act as a retention signal for 
the ER. Each of these signal peptides, or a combination 
thereof, can be used to traffic the APL of the invention 

10 as desired- For example, a construct encoding a given APL 
linked to an ER-targeting signal peptide would direct the 
peptide to the ER, where it would bind to the class II MHC 
molecule as it is assembled, preventing the binding of 
intact invariant chain (li) which is essential for 

15 trafficking. Alternatively, a construct can be made in 

which an ER retention signal on the APL would help prevent 
the class II MHC molecule from ever leaving the ER. If 
instead an APL of the invention is targeted to the 
endosomic compartment, this would ensure that large 

20 quantities of the APL are present at the site of class II 
MHC to peptide coraplexing, thereby increasing the 
likelihood that the peptide incorporated into the class II 
MHC complex is the APL of the invention rather than a 
naturally-occurring, irrelevant peptide. The likelihood 

25 of APL being available for incorporation into class II MHC 
can be increased by linking the APL to an intact li 
polypeptide sequence. Since li is known to traffic 
class II MHC molecules to the endosomes, the hybrid li 
would carry one or more copies of the APL along with the 

30 class II MHC molecule; once in the endosome, the hybrid li 
would be degraded by normal endosomal processes to yield 
both multiple copies of the APL (or peptides similar to 
it), and an open class II MHC peptide binding cleft. DNAs 
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encoding APL linked to targeting signals can be generated 
by PGR or other standard genetic engineering or synthetic 
techniques. Trafficking sequences are described in 
greater detail in U.S. Patent No, 5,827,516 incorporated 
5 herein by reference in its entirety. 

A promoter is a TRE composed of a region of a DNA 
molecule, typically located within 100 nucleotide pairs 
upstream of the point at which transcription starts. 
Enhancers provide expression specificity in terms of time, 

10 location, and level. Unlike a promoter, an enhancer can 
function when located at variable distances from the 
transcription initiation site, provided a promoter is 
present. An enhancer can also be located downstream of 
the transcription initiation site. The coding sequence of 

15 the expression vector is operatively linked to a 
transcription terminating region. 

Suitable expression vectors include plasmids and 
viral vectors such as herpes viruses, retroviruses, 
vaccinia viruses, attenuated vaccinia viruses, canary pox 

20 viruses, adenoviruses and adeno-associated viruses, among 
others. 

Polynucleotides can be administered in a 
pharmaceutically acceptable carrier. Pharmaceutically 
acceptable carriers are biologically compatible vehicles 

25 which are suitable for administration to a human, e.g., 
physiological saline. A therapeutically effective amount 
is an. amount of the polynucleotide which is capable of 
producing a medically desirable result in a treated 
animal. As is well known in the medical arts, the dosage 

30 for any one patient depends upon many factors, including 
the patient's size, body surface area, age, the particular 
compound to be administered, sex, time and route of 
administration, general health, and other drugs being 
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administered concurrently* Dosages will vary, but a 
preferred dosage for administration of polynucleotide is 
from approximately 10^ to lO" copies of the polynucleotide 
molecule. This dose can be repeatedly administered, as 
5 needed- Routes of administration can be any of those 
listed above. 

4 . 2 Ex Vivo Approaches 

In one ex vivo approach, lymphoid cells, including 

10 CD4"** T lymphocytes, are isolated from the subject and 
exposed to the APL in vitro. The lymphoid cells can be 
exposed once or multiply (e.g., 2, 3, 4, 6, 8, or 10 
times). The pattern of cytokine production by the 
lymphoid cells can be tested after one or more exposures . 

15 Once the desired cytokines are being produced by the 

lymphoid cells, they are reintroduced into the subject via 
any of the routes listed herein. The therapeutic or 
prophylactic efficacy of this ex vivo approach is 
dependent on the ability of the ex vivo APL-activated 

20 lymphocytes to actively suppress a pathogenic 004* T cell 
response to the parental wild-type peptide. The potential 
value of such an approach is indicated by experiments in 
which 004"^ T cells producing Th2- (or ThO- or Th3-)type 
cytokines actively suppressed ongoing Thl responses and 

25 disease caused by such Thl responses [Nicholson and 
Kuchroo (1996), Curr. Opinion in Immunol. 8:837-842]. 

An alternative ex vivo strategy can involve 
transfecting or transducing cells obtained from the 
subject with a polynucleotide containing the APL-encoding 

30 minigenes described above. The transfected or transduced 
cells are then returned to the subject. While such cells 
would preferably be lymphoid cells, they could also be any 
of a wide range of types including, without limitation. 
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fibroblasts/ bone marrow cells, macrophages, monocytes, 
dendritic cells, epithelial cells, endothelial cells, 
keratinocytes, or muscle cells in which they act as a 
source of the APL for as long as they survive in the 
5 subject. The use of lymphoid cells would be particularly 
advantageous in that such cells would be expected to home 
to lymphoid tissue (e.g., lymph nodes or spleen), and thus 
the APL would be produced in high concentration at the 
site where they exert their effect, i.e., activation of an 

10 immune response. By using this approach, active in vivo 
immunization with the APL is achieved. The same genetic 
constructs and trafficking sequences described for the in 
vivo approach can be used for this ex vivo strategy. 

The ex vivo methods include the steps of harvesting 

15 cells from a subject, culturing the cells, transducing 
them with an expression vector, and maintaining the cells 
under conditions suitable for expression of the APL. 
These methods are known in the art of molecular biology. 
The transduction step is accomplished by any standard 

20 means used for ex vivo gene therapy, including calcium 
phosphate, lipofection, electroporation, viral infection, 
and biolistic gene transfer. Alternatively, liposomes or 
polymeric microparticles can be used. Cells that have 
been successfully transduced are then selected, for 

25 example, for expression of the minigene or of a drug 
resistance gene. The cells may then be lethally 
irradiated (if desired) and injected or implanted into the 
patient. 

These methods of the invention can be applied to any 
30 of the diseases and species listed here. Methods to test 
whether an APL is therapeutic for or prophylactic against 
a particular disease can be simple modifications of the 
above-described methods for establishing the type of CD4'*' 
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T lymphocyte response elicited by a particular APL. Where 
a therapeutic effect is being tested, a test population 
displaying syit^toms of the disease (e.g., cancer patients) 
is treated with a test APL, using any of the above 
described strategies. A control population, also 
displaying symptoms of the disease, is treated, using the 
same methodology, with a placebo. Disappearance or a 
decrease of the disease symptoms in the test subject would 
indicate that the APL was an effective therapeutic agent - 

By applying the same strategies to subjects prior to 
onset of disease symptoms (e.g., experimental animals in 
which an appropriate disease can be deliberately induced, 
e.g., tumor model), APL can be tested for efficacy as 
prophylactic agents, i.e., vaccines. In this situation, 
prevention of onset of disease symptoms is tested. 

The following examples are meant to illustrate, not 
limit, the invention. 

Example 1, Identification of HLA Class II Binding CEA 

Peptide Epitopes 

Materials and Methods 

Epstein-Barr Virus (EBV)- Transformed B cell Lines 
expressing CEA encoding cDNA . EBV- transformed B 
lymphocyte lines, JY and A2.140, were propagated in 
RPMI 1640 medium supplemented with glut amine, 
penicillin/streptomycin and 10% fetal calf serum (FCS) . 
CEA-encoding cDNA was inserted into the expression vector 
pcDNA3.1-Zeo (Invitrogen, Carlsbad, CA) at the EcoR V and 
Xba I restriction sites to give pcDNA3 . 1-CEA. pcDNA3.1- 
Zeo has a CMV promoter and a zeomycin selection marker. 
The sequence of the CEA-encoding cDNA (SEQ ID NO: 12) 
inserted into the vector is shown in Fig. lA and the amino 
acid sequence (SEQ ID NO: 16) of CEA encoded by the cDNA is 
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shown in Fig. IB. pcDNA3.1-CEA and pcDNAS.l (a negative 
control vector which was pcDNA3.1-Zeo without a protein- 
encoding insert) were linearized and both transfected by 
electroporation into JY and A2,140 cells. Stably 
5 transfected cells were selected with Zeonycin (50 |ig/ml; 
sold as Zeocin™) and the transfected cells were cloned by 
limiting dilution. All surviving clones were then tested 
for CEA expression using the murine mAb COL-1 (specific 
for human CEA) by western blot and FACS analysis. Three 

10 high-expressing CEA clones were selected for further 
analysis {JY-CEA-9, A2. 140-CEA-15 and A2. 140--CEA-38) . 
Fig. 2 shows the cell surface expression of CEA on JY-CEA- 
9 cells as detected by FACS analysis. 

Culture of EBV-Trans formed B Cell Lines . EBV- 

15 transformed B lymphocyte lines were propagated in 
RPMI 1640 medium supplemented with glutamine, 
penicillin/streptomycin and 10% fetal calf serum (FCS) in 
2 L roller bottles to a density of -10^ cells/mL. JY cells 
and transfectants produced using them (e.g., JY-CEA-9) 

20 have the HLA-DRB*0401/HLA-DRB1*1301 genotype. A2.140 and 
transfectants produced using them (e.g., A2 . 140-CEA-15 and 
A2.140-CEA-38) have the HLA-DRB1*0101 genotype. 

HLA class II purification . Cells were harvested and 
25 pelleted by centrifugation. The cell pellets were weighed 
to determine the cellular mass and then frozen at -80°C 
prior to lysis. Each cell pellet was resuspended in lysis 
buffer (2 ml per gram of pellet) containing 1% CHAPS, 500 
mM NaCl, 20 mM Tris-OH pH 8.0, in Milli-Q™ reverse osmosis 
30 quality (about 18.2 \xQ) water containing freshly added 
protease inhibitors (100 (iM iodoacet amide, 8 |ig/ml 
aprotinin, 10 jig/mL leupeptin, 10 jig/mL pepstatin A, 5 mM 
EDTA, 0.04 % sodium azide, 1 mM PMSF) , and the cells were 
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lysed by gentle agitation for 1 hr at 4^C on a rotor 
table. The resulting cell lysate was sedimented by ultra- 
centrifuge for 1 hr at 230,000 x g at 4^C (37,500 RPM on a 
SW41 TI rotor) . The insoluble material was removed by 
5 sedimentation at 175,000 x g for 2 hours, and the soluble 
supernatant fraction used for subsequent HLA purification. 
Multi-modal protein purification using HPLC columns was 
achieved by coupling the chromatographic sorbents in 
series with automated switching valves that direct the 

10 class II HLA-peptide complex containing effluent to 
subsequent columns in the sequences- The first three 
coupled columns were connected directly in series and 
acted together as a single pre-clearing column using high 
strength large throughpore perfusion sorbents (6000-8000 A 

15 throughpores and 500-1000 A diffusive pores, 50 lam) coated 
and crosslinked with a hydrophilic stationary phase and 
covalently conjugated with Protein A as the sorbent. 
These columns were designed to remove those proteins that 
non-specif ically bind to the sorbents. Column 1 contained 

20 unmodified Protein A sorbent, column 2 contained Protein A 
coated with normal mouse serum, and column 3 was Protein A 
coated with bovine serum. The pre-clearing columns were 
followed by an immunoaf f inity column of Protein A coupled 
with mAb specific for a non-polymorphic determinant on 

25 HLA-DR molecules (A-LB3.1). After passing the lysate 
through the immunoaff inity column, the column was 
extensively washed with 50 column volumes of 0.1 % 
CHAPS/500 mM NaCl/0.05 % sodium azide/20 mM Tris-OH pH 
8.0, followed by 50 column volumes of 0.1 % DOC/20 mM 

30 Mops/280 mM NaCl/0.05 % sodium azide pH 8.0, and finally 
100 colimn volumes of 0.1 % DOC/0.05 % sodium azide/10 mM 
Tris-OH pH 8.0. The HLA-DR-peptide complexes were eluted 
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from the immunoaf finity support using 3.5 column volumes 
of 50 mM carbonate/0.1% DOC/0,05% NaNa at pH 11.5. 

Peptide Analysis 
5 The HLA class II protein samples can be concentrated 

by various methods. In one such method, the HLA class II 
protein sample eluted from the immunoactivity column was 
concentrated to 100 jxl using an ultrafiltration device 
(Amicon Centficon 10) prior to peptide extraction. 

10 Naturally processed peptide mixtures were acid eluted from 
HLA class II molecules by adding 800 nl 10 % acetic acid 
and incubating for 15 minutes at 7 0*^0, as previously 
described [Chicz et al. (1993), J. Exp. Med. 178:24-47]. 
The peptides were separated from the remaining HLA protein 

15 by ultrafiltration with an Amicon Centricon 10™ device. 
The "flow- through" fraction containing the acid-extracted 
peptides was concentrated on a Savant SpeedVac™ 
centrifugal vacuum concentrator to a vol\ame of 
approximately 20-30 \il and stored at -80**C. The acid- 

20 extracted peptide mixtures were then separated by reverse 
phase chromatography as previously described [Chicz et al. 
(1993), supra] but with minor modifications. Briefly, 
peptide solutions were preconcentrated by trapping the 
peptides using a small bed (0,5 - 3.0 |J.L bed volume) of 

25 polymeric reversed phase support. This also facilitates 
removal of hydrophilic contaminants that may be in the 
sample by washing the trap with a suitable aqueous 
solution (e.g., the buffers used for the chromatographic 
separation of the isolated peptides) . Subsequently, 

30 peptides were back-flushed from the trapping phase to a 
microbore C18 coliamn (1.0 x 250 mm; Vydac, Hesperia, CA) , 
and peptide fractionation was performed at a flow rate of 
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50 fil /minute. The column effluent was collected and 
stored at -20 °C prior to analysis by mass spectrometry. 

An alternative approach was to concentrate H3aA 
class II molecule samples and extract the associated 
5 peptides using solid phase extraction (SPE) . In this 
method, the HLA class II molecule samples were 
concentrated using a protein capture column that contained 
a suitable HPLC stationary phase. Appropriate HPLC 
stationary phases include a reversed phase (preferably a 

10 C18 or a polymeric CIS-like reversed phase) . However^ a 
cation or anion exchange resin (including both strong, 
weak and mixed bed ion exchange phases) or other 
substrates that exhibit high affinity for protein 
complexes while enabling the elution of the HLA class II 

15 peptides, are equally suitable. Suitable dimensions of 
the protein capture column are 1-10 mm internal diameter 
and 2-10 cm long. Once the protein was bound to the 
capture column, the column was washed with suitable 
solvents (e.g., an aqueous buffer containing 5-25 mM Tris 

20 base at pH 7.5-8.5 followed by an aqueous solution of 

trif luroacetic acid (0.05-0.2% v/v) ) to remove hydrophilic 
contaminants. The HLA class II molecule-peptide complexes 
were next disrupted by the action of a suitable solvent 
that also facilitated the elution of peptides from the 

25 protein capture column. An appropriate solvent for this 
purpose is a mixture of acetonitrile (5-25% v/v), TFA 
(0.05-5% v/v) in water. The eiuted peptides were 
collected in a suitable container (e.g., an Eppendorf"" 
microfuge tube) and concentrated using a Savant Speedvac^ 

30 centrifugal vacuum concentrator. This step lowered the 
acetonitrile concentration and thereby permitted peptide 
fractionation by reverse phase chromatography (as 
described above) . All peptide samples were stored at - 
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20°C prior to analysis by mass spectrometry. 
Alternatively, the eluted peptides were mixed post-elution 
from the protein capture column with an aqueous solution 
of TFA (typically 0.05-0,2% TFA in water) in order to 
5 lower the acetonitrile concentration and thereby permit 
peptide adsorption onto a peptide capture column. The 
peptide capture column contained an HPLC phase (e.g., a 
reversed phase resin such as CI 8 or polymeric equivalent 
and an ion exchange resin or other suitable phase that 

10 exhibits high affinity towards peptides) • Suitable 

dimensions of the peptide capture column are 0.5-4 ,6 mm 
internal diameter and 1-10 cm long. Following the 
adsorption of peptides, the peptide capture column was 
washed with a suitable solvent (e.g., 0-5% acetonitrile, 

15 0.05-0,2% TFA in water) prior to peptide fractionation by 
reversed phase chromatography {as described above) . All 
peptide samples were stored at -20 °C prior to analysis by 
mass spectrometry. 

Peptide samples were prepared for matrix assisted 

20 laser desorption time-of-f light (MALDI-TOF) mass 

spectrometry analysis by adding 1.0 iiiL of each sample to a 
spot on the sample plate. A matrix solution (0,5 |nL of a 
solution of a-cyano-4-hydroxycinnamic acid (10 mg/ml) in 
50% acetonitrile/0.1 % trif luoroacetic acid) was added to 

25 the dry sample spots and allowed to air dry. The matrix 
solution also contained des-arg bradykinin and glu- 
fibrinopeptide B (300 fmol/fxL of each) of each as internal 
mass scale calibrants. Mass spectra were obtained at 
optimum laser intensities by averaging the ion signals 

30 from 50 to 256 individual scans in both linear and 
reflector modes using a single stage extended length 
reflector time-of-f light mass spectrometer (Voyager Elite 
XL; PerSeptive Biosystems, Framingham, MA) . 
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An automated microcapillary liquid chromatography- 
mass spectroscopy (LC-MS) approach with either targeted or 
data dependent collision-assisted dissociation (CAD) was 
used to sequence low levels of naturally processed HLA 
5 associated peptides that were isolated by the MALDI-TOF-MS 
approach previously described. Peptide fractions 
separated by reversed phase chromatography were diluted to 
a final volume of 5-20 jil to aid handling and permit 
second dimension reversed phase separations. Each diluted 

10 peptide solution was then concentrated by trapping 
peptides in a small bed (0.5 - 1.0 bed volume) of 
polymeric reversed phase support. This step also 
facilitated removal of hydrophilic contaminants by washing 
the trap with a suitable aqueous solution (e.g., mobile 

15 phase A as used in the chromatographic separation of 

isolated peptides) . The peptides were then back flushed 
from the trapping phase onto the microcapillary (with an 
inner diameter of 75 \m and packed with 3-10 cm of 1-7 jxm 
100-200 A Ci8 or non-porous material) and separation was 

20 developed using a non-linear gradient of conventional 
mobile phases for peptide separations (typically 
combinations of water and acetonitrile containing a 
suitable ion pair reagent) , A mobile phase flow rate of 
0.15-0-20 |j.L/min was achieved by splitting the flow from 

25 the pumps and using a backpressure regulator. Peptide 
detection was by |Li-electrospray mass spectrometry. The 
voltage necessary to drive the electrospray was applied at 
the head of the microcapillary column and peptides were 
electrosprayed into the mass analyzer directly as they 

30 eluted from the column. CAD experiments were either 
predetermined to conduct specific target analyses or 
triggered in a data dependent mode, using ions that were 
more abundant than a user-set threshold. Dynamic 
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exclusion was used in conjunction with data dependent 
analyses to ensure maximum peptide coverage (i.e., minor 
responses were analyzed by CAD following a user-determined 
number of CAD experiments of a single peptide response) by 
5 writing an exclusion list during assay progression so that 
a given ion will not be analyzed by multiple CAD 
experiments. The time that a given ion resides on the 
exclusion list was dependent upon the quality of the 
chromatographic separations. This time is determined 
10 experimentally- In this way, separated isobaric responses 
may be analyzed. Peptide sequencing sensitivities greater 
than 100 attomoles were achieved using this method. 

Results 

15 A cDNA sequence encoding the tumor antigen CEA (SEQ 

ID NO: 12) was cloned into a mammalian expression vector 
(pcDNA3, 1-Zeo) and was transfected into and expressed in 
JY EBV-transformed B lymphocytes and A2.140 EBV- 
trans formed B lymphocytes using the methods described 

20 above. Transfectants were selected using Zeocin and the 
resulting stable transfectants were cloned by limiting 
dilution. Six control clones stably transfected with the 
pcDNA3.1-Zeo vector without an expressible insert 
("parental vector") and 43 CEA expressing transfectant 

25 clones were generated with JY cells. Six control clones 
stably transfected with the parental vector and 7 CEA 
expressing transfectant clones were generated with A2.140 
cells. All clones were then tested for CEA expression by 
western blot and FACS analysis using the murine mAb Col-1, 

30 Three CEA high- expressing clones (one from JY cells and 
two from A2.140 cells) were selected for further analysis 
(JY-CEA-9, A2.140-CEA-15 and A2 . 140-CEA-38) . Two control 
clones, one each from JY (clone JY-pcDNA3-lG9) and A2.140 
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cells (clone A2 . 140-pcDNA3) were also selected for peptide 
repertoire comparison. 

After growing the cells to a density of ~10^ cells/inL, 
each preparation was harvested by centrifugation. JY- 
5 pcDNA3-lG9 was harvested from 10.8 L of medivim to yield 
24.0 grams of cell pellet. JY-CEA-9 was harvested from 
10.8 L of medium to yield 23.6 grams of cell pellet. 
A2.140-pcDNA3 was harvested from 20 L of raediiim to yield a 
41.0 gram pellet. A2. 140-CEA-15 was harvested from 14.4 L 
10 of medium to yield a 33.0 gram pellet. A2. 140-CEA-38 was 
harvested from 14,4 L of medium to yield a 30.2 gram 
pellet. 

Protein purification was accomplished by 
immunoaf f inity chromatography as described above. HLA-DR 

15 molecules purified from JY are heterogenous and include 
both HLA-DR4 and HLA-DR13 molecules. The JY-pcDNA3-lG9 
HLA-DR yield from the Protein- A-LB3 . 1 column was 5.8 mg. 
The JY--CEA-9 HLA--DR yield from the Protein-A-LB3 . 1 column 
was 5.0 mg. The only HLA~DR molecules A2.140 cells 

20 express are HLA-DRl molecules. The A2.140-pcDNA3 HLA-DRl 
yield from the Protein-A-LB3 , 1 column was 12.0 mg. The 
A2-140-CEA-15 HLA-DRl yield from the Protein-A-LB3 . 1 
column was 12.5 mg. The A2. 140-CEA-38 HLA-DRl yield from 
the Protein-A-LB3. 1 column was 8.0 mg. 

25 Peptides were eluted from each HLA-DR protein 

preparation and separated by RP-HPLC, and each of the 
100 fractions collected was analyzed by MALDI-TOF. 
RP-HPLC analysis was highly reproducible, with 
chromatographic traces from the CEA-expressing and control 

30 CEA-non-expressing HLA-DR4/13 preparations showing 

similarity to each other. Similarly, the traces obtained 
with peptides from CEA-expressing and control CEA non- 
expressing HLA-DRl preparations showed similarity to each 
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Other, The chromatographic traces of the peptide 
repertoires from HLA-DR4/13-expressing JY cells and HLA- 
DRl-expressing A2.140 cells were distinguishable from each 
other. A subtractive approach was used to identify CEA- 
5 derived peptides. Mass spectra of equivalent RP-HPLC 
fractions from the A2 . 140~pcDNA3 and A2. 140-CEA-38 
peptides preparations were overlaid and masses common to 
both were discounted from further analysis. An example of 
such a profile is shown in Fig. 3. In Fig. 3, while peaks 

10 with m/z values of about 1704.8, 1805.9 and 1899.9 were 
seen in the spectra obtained with peptide mixtures from 
both A2.140-CEA--38 and A2 . 140-pcDNA3 cells, a peak with a 
m/z value of 1801.9 was seen only in the spectrum obtained 
with the peptide mixture from A2 . 140-CEA-38 cells. 

15 Similar MALDI-TOF analyses were performed for the peptide 
repertoires isolated from JY-CEA-9 and JY-pcDNA3-lG9 
control cell lines as well as A2 • 140-CEA-15 and A2.140- 
pcDNA3 cell line control. 

Of the approximately 40,000 m/z values observed, 

20 165 novel masses were initially identified as potential 
naturally processed peptides from CEA. Subsequent mass 
analyses using MS/MS fragmentation were performed on 278 
of the 500 RP-HPLC fractions as described. Approximately 
50,000 MS/MS experiments were performed in both data 

25 dependent and targeted mass analyses. Six CEA peptides 
from the JY-CEA-9 cell line (SEQ ID N0S:l-6) were 
identified and sequenced (Figs. 4-9). One CEA-derived 
peptide was found in both the A2 . 140-CEA-15 and A2.140- 
CEA-38 cell lines (SEQ ID NO: 7) (Fig, 10). 

30 Thus, the described IMF method applied to the 

analysis of peptides produced by natural processing of CEA 
identified seven peptides that are associated with HLA- 
DR4/13 and HLA-DRl. This knowledge provides the basis for 
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the development of therapeutic and/or prophylactic agents 
against CEA associated cancers, e.g., colon cancer. It is 
expected that analogous methodologies can be similarly 
successful in identifying other class II MHC-restricted 
5 ttimor antigen peptides that activate 004"*" T cells and are 
involved in the CD4'*" T lymphocyte-mediated pathogenesis of 
other diseases (see above) in which susceptibility is 
linked to the expression of a particular class II MHC 
molecule . 

10 Three additional peptides (SEQ ID NOS: 13-15) from 

the JY-CEA-9 cell line were subsequently identified and 
sequenced (Figs. 10-12) . 

Epitope Verification (EV) : 

15 To confirm that peptide epitopes identified are 

relevant to cancer (i.e., that they are recognized by 004"*" 
T cells of patients with cancer) , T cell proliferation 
assays can be carried out using synthetic peptides having 
amino acid sequences based upon the sequence of the 

20 peptides identified by mass spectrometry to be derived 
from CEA (e.g., the peptides with SEQ ID N0S:l-7 and IS- 
IS) . Peptides can be synthesized using Fmoc chemistry and 
purified by RP-HPLC. The amino acid sequences and purity 
of greater than 90% for all the synthetic peptides can be 

25 confirmed by MALDI-MS and analytical HPLC. Peripheral 
blood mononuclear cells from recent onset cancer patients 
(and healthy controls expressing the appropriate HLA-DRl 
and HLA-DR4 molecules) can be separated by density 
gradient centrifugation and co-cultured in wells of 96- 

30 well U-bottom plates with peptides at a concentration of 
10 ng/ml for 5 days in 150 \xl RPMI 1640/10% pooled normal 
AB serum, followed by pulsing with 0.5 \iCi [^H]- 
thymidine/well and harvesting onto filters for 
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radioactivity counting measured in counts per minute 
(cpm) • There are generally twelve replicate wells per 
test group. Results are expressed as a stimulation index 
(SI) which is the ratio of the counts per minute (cpm) 
5 obtained in samples from cultures containing peptide to 
the cpm obtained in samples from cultures without peptide 
(mean cpm of 12 wells in each case) • The data are also 
analyzed in terms of the fraction of "positive culture 
wells." A positive culture well is one that contained 
10 peptide and resulted in cpm > mean cpm + 2SD obtained from 
cultures without peptide. T cell responses are considered 
significant when the SI is >2.0 and >40% wells are 
positive. 

Data obtained from the above EV analysis are expected 
15 to confirm that the peptides identified by the IMF method 
are recognized specifically by CD4'*' T lymphocytes from HLA- 
DRl and-HLA-DR4 expressing cancer patients and thus may be 
implicated in immunity to the cancer. 

20 Example 2, Binding of consensus peptides to isolated HLA- 
DRl and HLA-DR4 molecules 

Synthetic peptides with amino acid sequences based on 
the 3 core regions identified by the IMF can be tested for 

25 their ability to bind to isolated HLA-DRl and HLA-DR4 
molecules in binding inhibition assays performed 
essentially as previously described [Chicz et al, (1997), 
J. Immunol- 159: 4935-4942]. In brief , aliquots of 
immunopurified preparations of HLA-DRl and HLA-DR4 (final 

30 concentration of 10 jig/ml) are incubated with a 

biotinylated HLA-DR specific binding peptide (consisting 
of residues 98-117 of class II MHC invariant chain) ("the 
indicator peptide") (1 jxM) and varying concentrations of 
the test peptides in 0.2 ml tubes. After an overnight 



- 66 - 



wo 2002/022803 



PCTAJS2001/028467 



incubation at room temperature, the contents of each tube 
are transferred to a well of a 96-well plastic microtiter 
plate precoated with anti-HLA-DR antibody. The microtiter 
plates are rocked for 60 min at room temperature and 
5 unbound material is removed by rigorous washing. The 
relative amount of bound standard peptide in each well 
will be determined by measuring color development after 
addition of streptavidin-conjugated alkaline phosphatase, 
washing, and adding a chromogenic alkaline phosphatase 
10 substrate. 

Other Embodiments 

The invention also features the following 
embodiments, 

15 Methods of Use 

The peptides or APL of the invention can be used to 
activate CD4"^ memory T cells specific for CEA by in vitro 
methods known to those in the art. After a patient 
undergoes surgery to remove a tumor, such CEA-specific 004*** 

20 memory T cells may be administered to establish tumor- 
specific, long-lasting immunity against tumor rechallenge. 
Topalian [Current Opinion in Immunology (1994), 6:741-745] 
reviews the role of anti-tumor CD4'*' T cells in 
immunological memory. Studies have demonstrated that 

25 tumor-specific immunological memory can be established by 
vaccination with IL-2 -transduced tumor cells, and 004*^ T 
cells are a critical component of this memory response. 

The peptides or APL of the invention can also be used 
to help class I MHC restricted (CD8*^) CTL responses. For 

30 example, CDS'*" T cells recognizing the gag-L-encoded CTL 
epitope have been demonstrated to be effector cells that 
are efficiently activated with help from peptide- 
activated, tumor-specific CDA* T cells [Ossendorp et al. 
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(1998), J. Exp. Med. 187:693-702]. In addition, mature, 
monocyte-derived dendritic cells were used to elicit 
resistance to malignant melanoma. [Thurner et al. (1999), 
J. Exp. Med. 190:1669-1678]. Intracutaneous injections of 
5 Mage-3A1 peptide-pulsed mature dendritic cells enchanced 
Mage-3Al-specific CDS* CTL responses and activated CD4^ T 
cell activity in stage IV melanoma patients with large 
tumor loads. 

004"*" T cells liave been shown to play a role in the 

10 induction and persistence of CD8^ T cells. A peptide 

epitope h-gplOO (melanocyte differentiation antigen which 
is expressed by more than 75% of melanomas) that binds to 
several class II MHC molecules was identified and 
characterized. The MHC class Il-restricted epitope, which 

15 was capable of inducing tumor-reactive CD4'*' cells, could 
potentiate antitumor effector function and long term 
immunity [Touloukian et al. (2000), J. Immunol. 164:3535- 
3542] . The peptide was able to induce specifically the in 
vitro expansion of CD4'*" T cells. Thus, the peptides or APL 

20 of the invention could be used as immunogens, together 
with one or more CD8* T cell epitopes produced by 
appropriate CEA-expressing tiomor cells. Interestingly in 
this regard, the human Melan-A/MART-1 protein that 
includes an HLA-A2-restricted peptide epitope recognized 

25 by melanoma-reactive CDS* CTLs also contains at least one 
HLA-DR4 -presented peptide recognized by CD4* T cells 
[Zarour et al. (2000), Proc. Natl. Acad. Sci. USA 97:400- 
405] . 

The peptides or APL of the invention can also be used 
30 to activate cytokine production for the recruitment of 
turaoricidal eosinophils and macrophages. Hung et al. 
[(1998), J. Exp. Med. 188:2357-2368] showed that 
vaccination with irradiated tumor cells transduced to 
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secrete granulocyte/macrophage colony-stimulating factor 
resulted in activation of CD4'*' T cells which, by the action 
of the cytokines that they produce, activated eosinophils 
and macrophages to produce superoxide and nitric oxide, 
both which have tumoricidal activity. It will be possible 
to screen for a peptide or APL of the invention that will 
stimulate CD4* T cells with such activity. 

Qin et al. [(2000), Immunity 12: 677-686] showed that 
in vivo CD4* T cell mediated immunity to tumors was due to 
inhibition of ttimor-induced angiogenesis, was dependent on 
IFN-y, and required expression of the receptor for IFN-y 
on non-hematopoietic cells. Peptides or APL of the 
invention can be useful in activating CD4* T cells with 
this anti-tumor activity. 

Reagents that Bind to Peptide-Class II MHC Complexes 

Reagents that bind to peptide-class II MHC complexes 
can be made, for example, by screening a phage display 
library in which the phage particles contain nucleic acid 
20 sequences encoding antibody fragments such as Fab or 

single chain Fv (scFv) fragments. Such libraries can be 
screened by testing for the presence of and isolating 
phage particles with the ability to bind to the peptides 
of the invention (e.g., KEVLLLVHNLPQH (SEQ ID N0:1), 
25 KEVLLLVHNLPQHL (SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID 
NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GECEVLLLVHNLPQHL 
(SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID N0:6); 
YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL (SEQ ID 
NO: 13); NPPAQYSWLIDGNIQQH (SEQ ID NO: 14); or 
30 NPPAQYSWLIDGNIQQHT (SEQ ID N0:15)), bound to a class II 
MHC molecule of interest- For example, phage display 
technology has been used to identify antibodies specific 
for defined HLA-DR2 peptide complexes associated with 
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multiple sclerosis [Krogsfaard et al, (2000), J. Exp. Med. 
191:1395-1412]. Alternatively, polyclonal antibodies or 
mAb can be screened for their ability to bind to peptide- 
class II MHC complexes of interest. 
5 The above antibody-based reagents can be used, for 

example, in diagnosing cancer. For example, binding to a 
test cell of a Fab, scFv, or an antibody (e.g., a mAb) 
specific for a HLA-DR4 molecule bound to a CEA peptide of 
the invention would indicate that the test cell is a 
10 cancer cell. Examples of detection agents used to detect 
binding of antibodies or antibody fragments include, 
without limitation, enzymes, radiolabels, luminescent 
compounds, and fluorescing compounds that elicit a 
detectable and measurable signal when the antibody 
15 complexes with the CEA naturally processed peptide . 
Examples of detectors include, without limitation, 
spectrophotometers , colorimeters , f luorometer s , 
luminometers and biacore machines. 

Therapeutic agents for treating cancer can be made, 
20 for example, by linking one of the above reagents with a 
therapeutic (e.g., cytotoxic) atom or molecule. An 
appropriate therapeutic agent, after administration (by 
any of the methods disclosed herein) to a subject with the 
relevant cancer, binds to the cancer cells. The 
25 therapeutic atom or molecule can then kill the cancer 
cell. Alternatively, the therapeutic agent is 
internalized by the cancer cell and then the therapeutic 
atom or molecule kills the cancer cell. The linkage 
between the reagent and therapeutic atom or molecule can 
30 be a covalent one or a relatively weak non-covalent one 
such that the complex dissociates after binding to the 
cancer cell surface. Examples of therapeutic atoms and 
molecules include chemotherapeutic compounds. 
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radioisotopes, and toxins, e.g., ricin or diptheria toxin, 
or toxic fragments of such toxins. 

Although the invention has been described with 
reference to the presently preferred embodiments, it 
5 should be understood that various modifications can be 
made without departing from the spirit of the invention. 
Accordingly, the invention is limited only by the 
following claims. 

10 What is claimed is: 
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1, A method of identifying a class II MHC~binding 
fragment of a polypeptide, the method comprising: 

(a) providing a mammalian antigen-presenting cell 
(APC) cell line transfected with a DNA encoding a 

5 polypeptide; 

(b) isolating from the APC a class II MHC molecule 
bound to a peptide, wherein the peptide is a class II MHC- 
binding fragment of the polypeptide; 

(c) elating the peptide from the class II MHC 
10 molecule; and 

(d) identifying the peptide as a fragment of the 
polypeptide. 

2. The method of claim 1, wherein the polypeptide 
has the sequence of a tumor antigen. 

15 3. The method of claim 1, wherein the APC is a 

dendritic cell. 

4. The method of claim 1, wherein the APC is a 
macrophage or a monocyte. 

5. The method of claim 1, wherein the APC is a 
20 B lymphocyte. 

6. The method of claim 1, wherein the mammal is a 
human - 

7- The method of claim 2, wherein the class II MHC 
molecule is selected from the group consisting of a HLA-DR 
25 molecule, a HLA-DQ molecule, and a HLA-DP molecule. 

8, The method of claim 2, wherein the class II MHC 
molecule is a DR molecule with a p-chain encoded by a gene 
selected from the group consisting of DRB1*0401, 
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DRB1*0101, DRB1*0301, DRB1*0701, DRB1*1101, DRB1*1301, 
DRB1*1302, DRB1*1501 and DRB5*0101. 

9. An isolated peptide fewer than 35 amino acid 
residues in length, comprising a sequence KEVLLLVHNLPQH 

5 (SEQ ID N0:1) , 

10. The isolated peptide of claim 9, comprising an 
amino acid sequence selected from the group consisting of: 
KEVLLLVHNLPQHL (SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID 

NO: 3); KEVLLLVHNLPQHL FG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 
10 (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6); and 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13). 

11. An isolated peptide fewer than 35 amino acid 
residues in length, comprising a sequence YLWWVNGQSLPVSPR 
(SEQ ID N0:7) . 

15 12, An altered peptide ligand (APL) ^ the amino acid 

sequence of which is identical, except for 1-6 amino acid 
substitutions, to a fragment of carcinoembryonic antigen 
(CEA) , the fragment being fewer than 35 amino acids 
residues in length and comprising a sequence selected from 

20 the group consisting of KEVLLLVHNLPQH (SEQ ID N0:1), 

YLWWVNGQSLPVSPR (SEQ ID NO: 7), and NPPAQYSWLIDGNIQQH (SEQ 
ID NO: 14), wherein no more than 30% of the amino acid 
residues of the fragment are substituted with different 
amino acid residues in the APL, and wherein the APL binds 

25 to a class II MHC molecule, 

13. The APL of claim 12, wherein the sequence of the 
fragment is selected from the group consisting of 
KEVLLLVHNLPQH (SEQ ID N0:1), YLWWVNGQSLPVSPR (SEQ ID 
NO: 7), and NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 
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14 . A process for making an APL, the process 
comprising: 

(a) carrying out the method of claim 1, and 

(b) synthesizing an APL consisting of a sequence 

5 which is identical to that of the peptide, except having 
amino acid substitutions at 1, 2, 3^ A, 5, or 6 positions 
in the peptide. 

15. The process of claim 14, wherein the polypeptide 
is CEA. 

10 16. A method of activating T cell reactivity in a 

mammal, the method comprising: 

(a) providing (i) a peptide, the sequence of which 
consists of the sequence of a naturally processed fragment 
of CEA, wherein the peptide binds to a class II MHC 

15 molecule of the mammal and elicits a CD4"^ T cell response, 
or (ii) a DNA encoding a polypeptide selected from the 
group consisting of (1) the peptide, (2) the peptide plus 
an amino terminal methionine residue, and (3) either (1) 
or (2) linked to a trafficking sequence; and 

20 (b) administering the peptide or DNA to the mammal, 

17. The method of claim 16, wherein the peptide is 
fewer than 35 amino acid residues in length and comprises 
a sequence KEVLLLVHNLPQH (SEQ ID N0:1). 

18. The method of claim 17, wherein the peptide is 
25 KEVLLLVHNLPQH (SEQ ID N0:1). 

19. The method of claim 17, wherein the peptide is 
KEVLLLVHNLPQHL (SEQ ID N0:2) - 

20. The method of claim 17, wherein the peptide is 
KEVLLLVHNLPQHLF (SEQ ID NO: 3). 
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21. The method of claim 17, wherein the peptide is 
KEVLLLVHNLPQHLFG (SEQ ID N0:4) . 

22 The method of claim 17, wherein the peptide is 
GKBVLLLVHNLPQHL (SEQ ID NO: 5). 

5 23. The method of claim 17, wherein the peptide is 

EGKEVLLLVHNLPQHLFG (SEQ ID NO: 6). 

24. The method of claim 16, wherein the peptide is 
fewer than 35 amino acid residues in length and comprises 
a sequence YLWWVNGQSLPVSPR (SEQ ID NO: 7). 

10 25. The method of claim 24, wherein the peptide is 

YLWWVNGQSLPVSPR (SEQ ID NO: 7). 

26. A method of altering a T cell response in a 
mammal, the method conprising: 

(a) providing (i) an APL having a sequence identical, 
15 except for amino acid substitutions at 1-6 positions, to 

the sequence of a naturally-processed fragment of CEA, 
wherein the APL binds to a class II MHC molecule of the 
mammal, or (ii) a DNA encoding a polypeptide selected from 
the group consisting of (1) the APL, (2) the APL plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence; and 

(b) administering the APL or DNA to the mammal. 

27. The method of claim 1, further comprising: 

(e) providing CD4* lymphocytes from a mammal having a 
25 condition suspected of being associated with presentation 

of the peptide by the class II MHC molecule, wherein the 
APCs of the mammal bear the class II MHC molecule; 

(f ) providing a population of APCs that bear the 
class II MHC molecule with the peptide bound thereto; 
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(g) contacting the population of APCs of (f) with the 
004"^ lymphocytes of (e) ; and 

(h) determining whether the CD4'*' lymphocytes recognize 
the class II MHC-bound peptide, as an indication that 

5 presentation of the peptide to CDA^ T lymphocytes is 
associated with the condition, 

28. The method of claim 27, wherein said 
presentation is associated with a pathological response of 
CD4"^ T lymphocytes. 

10 29. The method of claim 27, wherein said 

presentation is associated with a protective response of 
CD4*^ T lymphocytes. 

30. A method of diagnosis comprising: 

(a) providing a 004"** lymphocyte from an individual 
15 suspected of having or being susceptible to cancer; 

(b) providing an APC which bears on its surface a 
class II MHC molecule of an allele identical to one 
expressed by said individual, wherein the class II MHC 
molecule is bound to a CEA peptide; 

20 (c) contacting the APC with the CD4* lymphocyte; and 

(d) determining whether the CD4* lymphocyte recognizes 

the class II MHC-bound peptide, as an indication that the 

individual has or is susceptible to cancer, 

wherein the peptide comprises an amino acid sequence 
25 selected from the group consisting of: KEVLLLVHNLPQH (SEQ 

ID N0:1); KEVLLLVHNLPQHL (SEQ ID N0:2;) KEVLLLVHNLPQHLF 

(SEQ ID NO: 3); KEVLLLVHNLPQHL FG (SEQ ID NO: 4); 

GKEVLLLVHNLPQHL (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID 

NO: 6); YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL 
30 (SEQ ID N0:13); NPPAQYSWLIDGNIQQH (SEQ ID N0:14); and 

NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 
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31- A method of treating a subject suspected of 
having or being susceptible to cancer, the method 
comprising administering the peptide of claim 9 to the 
subject. 

5 32. A method of identifying a reagent for diagnosing 

cancer, the method comprising: 

(a) providing a test reagent selected from the group 
consisting of a Fab fragment, a monoclonal antibody (mAb) , 
and a single chain Fv (scFv) fragment; 
10 (b) providing a complex comprising a class II MHC 

molecule bound to a peptide comprising a sequence selected 
from the group consisting of KEVLLLVHNLPQH (SEQ ID N0:1), 
KEVLLLVHNLPQHL (SEQ ID NO: 2); KEVLLLVHNLPQHLF (SEQ ID 
NO: 3); KEVLLLVHNLPQHLFG (SEQ ID NO: 4); GKEVLLLVHNLPQHL 
15 (SEQ ID NO: 5); EGKEVLLLVHNLPQHLFG (SEQ ID N0:6); 

YLWWVNGQSLPVSPR (SEQ ID NO: 7); EGKEVLLLVHNLPQHL (SEQ ID 
N0:13); NPPAQYSWLIDGNIQQH (SEQ ID N0:14); and 
NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15); and 

(c) testing whether the test reagent binds to the 
20 complex, wherein binding to the complex is an indication 
that the test reagent is potentially useful for diagnosing 
cancer. 

33. The method of claim 30, wherein the class II MHC 
molecule is a DR molecule with a p-chain encoded by a 

25 DRB1*0401 gene. 

34. The method of claim 30, wherein the class II MHC 
molecule is a DR molecule with a p--chain encoded by a 
DRB1*0101 gene- 

35. A method of diagnosis, the method comprising: 
^0 providing a test cell from a mammalian subject; 
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(b) providing a reagent that binds to a CEA peptide 
fragment bound to a class II MHC molecule; 

(c) contacting the test cell with the reagent; and 

(d) detecting binding of the reagent to the test cell 
5 as an indication that the test cell is a cancer cell. 

36. A method of cancer therapy comprising: 

(a) providing a composition comprising a reagent 
selected from the group consisting of a Fab fragment, a 
mAb, and a scFv fragment, wherein the reagent recognizes a 

10 naturally processed CEA peptide bound to a MHC class II 
molecule, the reagent being linked to an agent selected 
from the group consisting of a chemotherapeutic compound, 
a radioactive isotope and a toxin; and 

(b) administering the composition to a subject 
15 suspected of having or being susceptible to a cancer 

characterized by expression of CEA. 

37. A method of identifying a class II MHC-binding 
fragment of a tumor antigen, the method comprising: 

(a) providing a mammalian APC comprising a class II 
20 MHC molecule and the tumor antigen; 

(b) isolating from the APC the class II MHC molecule 
bound to a peptide, wherein the peptide is a class II MHC 
binding fragment of the tumor antigen; 

(c) eluting the peptide from the class II MHC 
25 molecule; and 

(d) identifying the amino acid sequence of the 
peptide . 

38. The method of claim 37, further comprising: 

(e) providing CD4* lymphocytes from a mammal having a 
30 cancer suspected of being associated with presentation of 

the peptide by the class II MHC molecule, wherein the APCs 
of the mammal bear the class II MHC molecule; 
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(f) providing a population of APCs that bear the 
class II MHC molecule with the peptide bound thereto; 

(g) contacting the population of APCs of (f) with the 
CD4'*' lymphocytes of (e) ; and 

(h) determining whether the CD4"' lymphocytes recognize 
the class II MHC bound peptide, as an indication that 
presentation of the peptide to CD4^ lymphocytes is 
associated with the cancer. 

39. An isolated DNA comprising a nucleotide sequence 
encoding a peptide fewer than 35 amino acids in length and 
comprising the sequence of SEQ ID N0:1, 7 or 14. 

40. A vector comprising the DNA of claim 39. 

41. The vector of claim 40, wherein the nucleotide 
sequence is operatively linked to a transcriptional 

15 regulatory element. 

42. A cell comprising the vector of claim 40. 

43. A cell comprising the vector of claim 41. 

44. An isolated peptide fewer than 35 amino acid 
residues in length, comprising a sequence 

20 NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 

45. The isolated peptide of claim 44, comprising a 
sequence NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 

46. The method of claim 17, wherein the peptide is 
EGKEVLLLVHNLPQHL (SEQ ID NO: 13), 



25 



47. The method of claim 16, wherein the peptide is 
fewer than 35 amino acid residues in length and comprises 
a sequence NPPAQYSWLIDGNIQQH (SEQ ID NO: 14). 
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48- The method of claim 47, wherein the peptide is 
NPPAQYSWLIDGNIQQH (SEQ ID N0:14). 

49. The method of claim 47, wherein the peptide is 
NPPAQYSWLIDGNIQQHT (SEQ ID NO: 15). 

5 50. The method of claim 2, wherein the tiimor antigen 

is CEA. 

51. The method of claim 2, wherein the tumor antigen 
is selected from the group consisting of prostate specific 
antigen, MAGE 1-4, MAGE 6, MAGE 12, a mucin, tyrosinase, 
10 MART, Pmel 17, N-acetylglucoaminyltransf erase V intron V 
sequence. Prostate Ca psm, PRAME, p-catenin, MUM-l-B, 
GAGE-1, BAGE 2-10, C-ERB2, Epstein-Barr Virus nuclear 
antigen 1-6, gp75, human papilloma virus E6 and E7, p53, 
lung resistance protein, Bcl-2, and Ki-67. 

15 52. The method of claim 16, wherein the mammal is 

suspected of having or being susceptible to cancer. 

53. An isolated peptide comprising a fragment of 
CEA comprising KEVLLLVHNLPQH (SEQ ID N0:1) and a further 
1-15 residues of CEA sequence at the C-terminus of SEQ ID 
20 N0:1, at the N-terminus of SEQ ID N0:1, or at both the N- 
terminus and the C-terminus of SEQ ID N0:1. 

54- An isolated peptide comprising a fragment of 
CEA comprising YLWWVNGQSLPVSPR (SEQ ID NO: 7) and a further 
1-15 residues of CEA sequence at the C-terminus of SEQ ID 
25 NO: 7, at the N-terminus of SEQ ID NO: 7, or at both the N- 
terminus and the C-terminus of SEQ ID NO: 7. 
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55. An isolated peptide comprising a fragment of 
CEA comprising NPPAQYSWLIDGNIQQH (SEQ ID NO: 14) and a 
further 1-15 residues of CEA sequence at the C- 

5 terminus of SEQ ID NO: 14, at the N--terTtiinus of SEQ ID 

NO: 14, or at both the N-terminus and the C- terminus 
of SEQ ID NO: 14, 

56. A method of activating T cell reactivity in a 
mammal, the method comprising: 

10 (a) providing (i)the peptide of claim 53, or (ii) a 

DNA encoding a polypeptide selected from the group 
consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence; and 

15 (b) administering the peptide or DNA to the mammal. 

57. A method of activating T cell reactivity in a 
mammal, the method comprising: 

(a) providing {i)the peptide of claim 54, or (ii) a 
DNA encoding a polypeptide selected from the group 

20 consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence; and 

(b) administering the peptide or DNA to the mammal. 

58. A method of activating T cell reactivity in a 
25 mammal, the method comprising: 

(a) providing (i)the peptide of claim 55, or (ii) a 
DNA encoding a polypeptide selected from the group 
consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
30 (2) linked to a trafficking sequence; and 
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(b) administering the peptide or DNA to the mammal. 

59. An isolated fragment of CEA shorter than full- 
length CEA, wherein the fragment comprises one or more 
amino acid secjuences selected from the group consisting of 
5 SEQ ID N0:1, SEQ ID NO: 7, and SEQ ID NO: 14. 

60- A method of activating T cell responsiveness 
in a mammal, the method comprising: 

(a) providing (i) the fragment of claim 59, or (ii) a 
DNA encoding a polypeptide selected from the group 

10 consisting of (1) the fragment, (2) the fragment plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence, wherein the DNA does 
not encode full-length CEA; and 

(b) administering the fragment or DNA to the mammal. 

61. A method of activating T cell responsiveness 
in a mammal, the method comprising: 

(a) administering to the mammal (i) CEA or (ii) a DNA 
encoding CEA; and 

(b) testing CD4+ T cells of the animal for 
responsiveness to a naturally processed fragment of CEA. 



15 



20 



62. An isolated peptide comprising: 

(a) at least one CEA segment selected from the group 
consisting of SEQ ID N0:1, SEQ ID NO: 7, and SEQ ID N0:14; 

25 and 

(b) one or more amino acids at the C-terminus of the 
CEA segment, at the N-terminus of the CEA segment, or at 
both the C-terminus and the N-terminus of the CEA segment, 

wherein the peptide has an amino acid sequence that 
30 is not identical to that of full-length CEA. 
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63. A method of activating T cell reactivity in a 
mammal, the method comprising: 

(a) providing (i) the peptide of claim 62, or (ii) a 
DNA encoding a polypeptide selected from the group 

5 consisting of (1) the peptide, (2) the peptide plus an 
amino terminal methionine residue, and (3) either (1) or 
(2) linked to a trafficking sequence, wherein the DNA does 
not encode full-length CEA; and 

(b) administering the peptide or DNA to the mammal. 
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ATGGAGTCTCCCTCGGCCCCTCCCCACAGATGGTGCATCCCCTGGCAGAGGCTCCTGCT 

CACAGCCTCACTTCTAACCTTCTGGAACCCGCCCACCACTGCCAAGCTCACTATTGAAT 

CCACGCCGTTCAATGTCGCAGAGGGGAAGGAGGTGCTTCTACTTGTCCACAATCTGCCC 

CAGCATCTTTTTGGCTACAGCTGGTACAAAGGTGTIAAGAGTGGATGGCAACCGTCAAAT 

TATAGGATATGTAATAGGAACTCAACAAGCTACCCCAGGGCCCGCATACAGTGGTCGAG 

AGATAATATACCCCAATGCATCCCTGCTGATCCAGAACATCATCCAGAATGACACAGGA 

TTCTACACCCTACACGTCATAAAGTCAGATCTTGTGAATGAAGAAGCAACTGGCCAGTT 

CCGGGTATACCCGGAGCTGCCCAAGCCCTCCATCTCCAGCAACAACTCCAAACCCGTGG 

AGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGACTCAGGACGCAACCTACCTG 

TGGTGGGTi^AACAATCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGGCAA 

CAGGACCCTCACTCTATTCAATGTCACMGAAATGACACAGCAAGCTACAAATCTGAAA 

CCCAGAACCCAGTGAGTGCCAGGCGCAGTGATTCAGTCATCCTGAATGTCCTCTATGGC 

CCGGATGCCCCCACCATTTCCCCTCTAAACACATCTTACAGATCAGGGGAAAATCTGAA 

CCTCTCCTGCCATGCAGCCTCTAACCCACCTGCACAGTACTCTTGGTTTGTCAATGGGA 

CTTTCCAGCAATCCACCCAAGAGCTCTTTATCCCCAACATCACTGTGAATAATAGTGGA 

TCCTATACGTGCCAAGCCCATAACTCAGACACTGGCCTCAATAGGACCACAGTCACGAC 

GATCACAGTCTATGCAGAGCCACCCAAACCCTTCATCACCAGCAACAACTCCAACCCCG 

TGGAGGATGAGGATGCTGTAGCCTTAACCTGTGAACCTGAGATTCAGAACACAACCTAC 

CTGTGGTGGGTAAATT^TCAGAGCCTCCCGGTCAGTCCCAGGCTGCAGCTGTCCAATGA 

CAACAGGACCCTCACTCTACTCAGTGTCACAAGGAATGATGTAGGACCCTATGAGTGTG 

GAATCCAGAACGAATTAAGTGTTGACCACAGCGACCCAGTCATCCTGAATGTCCTCTAT 

GGCCCAGACGACCCCACCATTTCCCCCTCATACACCTATTACCGTCCAGGGGTGAACCT 

CAGCCTCTCCTGCCATGCAGCCTCTAACCCACCTGCACAGTATTCTTGGCTGATTGATG 

GGT^CATCCAGCAACACACACAAGAGCTCTTTATCTCCAACATCACTGAGAA-GAACAGC 

GGACTCTATACCTGCCAGGCCAATAACTCAGCCAGTGGCCACAGCAGGACTACAGTCAA 

GACAATCACAGTCTCTGCGGAGCTGCCCAAGCCCTCCATCTCCAGCTIACAACTCCAJUIC 

CCGTGGAGGACAAGGATGCTGTGGCCTTCACCTGTGAACCTGAGGCTCAGAACACAACC 

TACCTGTGGTGGGTAAATGGTCAGAGCCTCCCAGTCAGTCCCAGGCTGCAGCTGTCCAA 

TGGCAACAGGACCCTCACTCTATTCAATGTCACAAGAT^TGACGCAAGAGCCTATGTAT 

GTGGAATCCAGAACTCAGTGAGTGCAAACCGCAGTGACCCAGTCACCCTGGATGTCCTC 

TATGGGCCGGACACCCCCATCATTTCCCCCCCAGACTCGTCTTACCTTTCGGGAGCGAA 

CCTCAACCTCTCCTGCCACTCGGCCTCTAACCCATCCCCGCAGTATTCTTGGCGTATCA 

ATGGGATACCGCAGCAACACACACAAGTTCTCTTTATCGCCAAAATCACGCCAAATAAT 

AACGGGACCTATGCCTGTTTTGTCTCTAACTTGGCTACTGGCCGCAATAATTCCATAGT 

CAAGAGCATCACAGTCTCTGCATCTGGAACTTCTCCTGGTCTCTCAGCTGGGGCCACTG 

TCGGCATCATGATTGGAGTGCTGGTTGGGGTTGCTCTGATATAG 
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Fie. IB 

^DESPSAPPHRWCIPWQRLLLTASLLTFWNPPTTAKLTIESTPF^^VAEGKEVLLLV 

HNLPQHLFGYSWYKGERVDGNRQDGYVIGTQQATPGPAYSGREnYPNASLLIQN 

nQNDTGFYTLHVIKSDLVNEEATGQFRVYPELPKPSISSNNSKPVEDKDAVAFTC 

EPETQDATYLWWVNNQSLPVSPRLQI^G>niTLTLFNVTRNDTASYKCErQNPV 

SARRSDSmNVLYGPDAPTISPLNTSYRSGENLNLSCHAASNPPAQYSWFVNGT 

FQQSTQELFIPNITVNNSGSYTCQAHNSDTGLNRTTVTTITVYAEPPKPFrreh^^ 

PVEDEDAVALTCEPmQ>riTYLWWVNNQSLPVSPMX5LSNDNRTLTLLSVT^^ 

VGPYECGIQNELSVDHSDPVILNVLYGPDDPTISPSYTYYRPGVNLSLSCHAASNP 

PAQYSWLroGNIQQHTQELFISNITEKNSGLYTCQANNSASGHSRTrVKTITVSAE 

LP]a>SISS^INSKPVEDKDAVAFTCEPEAQ^^ITyL\^WVNGQSLPVSPRLQLSNGN 

RTLTUWTRNDARAYVCGIQNSVSANRSDPVTLDVLYGPDTPnSPPDSSYLSGA 

JCNI^SCHSASNPSPQYSWRINGIPQQHTQVLFIAKITPNNNGTYACFVSNLATGR 

NNSIVKSrrVSASGTSPGLSAGATVGIMIGVLVGVALI 
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SEQUENCE LISTING 

<110> Zycos Inc. 

<120> PEPTIDE EPITOPES BLEC0GNI2ED BY ANTIGEN 
SPECIFIC CD4+ T LYMPHOCYTES 

<130> 08191-019WO1 

<140> PCT/USOl/28467 
<141> 2001-09-12 

<150> 60/232,185 
<151> 2000-09-12 

<160> 16 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 13 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His 
1 5 10 

<210> 2 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu 
15. 10 

<210> 3 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe 
15 10 15 

<210> 4 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu Phe Gly 
15 10 15 

<210> 5 

<211> 15 

<212> PRT 

<213> Homo sapiens 
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2 

<400> 5 

Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu 
15 10 15 

<210> 6 
<211> 18 
<212> PRT 

<213> Homo sapiens 

<400> 6 

Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu 

15 10 15 

Phe Gly 



<210> 7 
<211> 15 
<212> PRT 

<213> Homo sapiens 

<400> 7 

Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser Pro Arg 
15 10 15 

<210> 8 
<211> 25 
<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Ala He Ser Gly Val Pro Val Leu Gly Phe Phe lie He Ala Val 

15 10 15 

Leu Met Ser Ala Gin Glu Ser Trp Ala 
20 25 

<210> 9 
<211> 5 
<212> PRT 

<213> Homo sapiens 

<400> 9 

Lys Phe Glu Arg Gin 
1 5 

<210> 10 
<211> 15 
<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asp Asp Gin Arg Asp Leu He Ser Asn Asn Glu Gin Leu Pro 
15 10 15 

<210> 11 

<211> 4 

<212> PRT 

<213> Homo sapiens 

<400> 11 
Lys Asp Glu Leu 
1 
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<210> 12 

<211> 2109 

<212> DNA 

<213> Homo sapiens 

<400> 12 

atggagtctc cctcggcccc tccccacaga tggtgcatcc cctggcagag gctcctgctc 60 

acagcctcac ttctaacctt ctggaacccg cccaccactg ccaagctcac tattgaatcc 120 

acgccgttca atgtcgcaga ggggaaggag gtgcttctac ttgtccacaa tctgccccag 180 

catctttttg gctacagctg gtacaaaggt gaaagagtgg atggcaaccg tcaaattata 240 

ggatatgtaa taggaactca acaagctacc ccagggcccg catacagtgg tcgagagata 300 

atatacccca atgcatccct gctgatccag aacatcatcc agaatgacac aggattctac 360 

accctacacg tcataaagtc agatcttgtg aatgaagaag caactggcca gttccgggta 420 

tacccggagc tgcccaagcc ctccatctcc agcaacaact ccaaacccgt ggaggacaag 480 

gatgctgtgg ccttcacctg tgaacctgag actcaggacg caacctacct gtggtgggta 540 

aacaatcaga gcctcccggt cagtcccagg ctgcagctgt ccaatggcaa caggaccctc 600 

actctattca atgtcacaag aaatgacaca gcaagctaca aatgtgaaac ccagaaccca 660 

gtgagtgcca ggcgcagtga ttcagtcatc ctgaatgtcc tctatggccc ggatgccccc 720 

accatttccc ctctaaacac atcttacaga tcaggggaaa atctgaacct ctcctgccac • 780 

gcagcctcta acccacctgc acagtactct tggtttgtca atgggacttt ccagcaatcc 840 

acccaagagc tctttatccc caacatcact gtgaataata gtggatccta tacgtgccaa 900 

gcccataact cagacactgg cctcaatagg accacagtca cgacgatcac agtctatgca 960 

gagccaccca aacccttcat caeca gcaac aactccaacc ccgtggagga tgaggatgct 1020 

gtagccttaa cctgtgaacc tgagattcag aacacaacct acctgtggtg ggtaaataat 1080 

cagagcctcc cggtcagtcc caggctgcag ctgtccaatg acaacaggac cctcactcta 1140 

ctcagtgtca caaggaatga tgtaggaccc tatgagtgtg gaatccagaa cgaattaagt 1200 

gttgaccaca gcgacccagt catcctgaat gtcctctatg gcccagacga ccccaccatt 1260 

tccccctcat acacctatta ccgtccaggg gtgaacctca gcctctcctg ccatgcagcc 1320 

tctaacccac ctgcacagta ttcttggctg attgatggga acatccagca acacacacaa 1380 

gagctcttta tctccaacat cactgagaag aacagcggac tctatacctg ccaggccaat 1440 

aactcagcca gtggccacag caggactaca gtcaagacaa tcacagtctc tgcggagctg 1500 

cccaagccct ccatctccag caacaactcc aaacccgtgg aggacaagga tgctgtggcc 1560 

ttcacctgtg aacctgaggc tcagaacaca acctacctgt ggtgggtaaa tggtcagagc 1620 

ctcccagtca gtcccaggct gcagctgtcc aatggcaaca ggaccctcac tctattcaat 1680 

gtcacaagaa atgacgcaag agcctatgta tgtggaatcc agaactcagt gagtgcaaac 1740 

cgcagtgacc cagtcaccct ggatgtcctc tatgggccgg acacccccat catttccccc 1800 

ccagactcgt cttacctttc gggagcgaac ctcaacctct cctgccactc ggcctctaac 1860 

ccatccccgc agtattcttg gcgtatcaat gggataccgc agcaacacac acaagttctc 1920 

tttatcgcca aaatcacgcc aaataataac gggacctatg cctgttttgt ctctaacttg 1980 

gctactggcG gcaataattc catagtcaag agcatcacag tctctgcatc tggaacttct 2040 

cctggtctct cagctggggc cactgtcggc atcatgattg gagtgctggt tggggttgct 2100 

ctgatatag 2109 

<210> 13 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 13 

Glu Gly Lys Glu Val Leu Leu Leu Val His Asn Leu Pro Gin His Leu 
15 10 15 

<210> 14 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Asn Pro Pro Ala Gin Tyr Ser Trp Leu lie Asp Gly Asn lie Gin Gin 
15 10 15 
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His 



<210> 15 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 15 

Asn Pro Pro Ala Gin Tyr Ser Trp Leu He Asp Gly Asn He Gin Gin 

1 5 * 10 15 

His Thr 



<210> 16 
<211> 702 
<212> PRT 

<213> Homo sapiens 



<400> 16 



Met Glu 


Ser 


Pro 


Ser Ala Pro Pro 


1 






5 


Arg Leu 


Leu 


Leu 


Thr Ala Ser Leu 






20 




Thr Ala 


Lys 


Leu 


Thr He Glu Ser 




35 




40 


Lys Glu 


Val 


Leu 


Leu Leu Val His 


50 






55 


Tyr Ser 


Trp 


Tyr 


Lys Gly Glu Arg 


65 






70 


Gly Tyr 


Val 


He 


Gly Thr Gin Gin 








85 


Gly Arg 


Glu 


He 


He Tyr Pro Asn 






100 




He Gin 


Asn 


Asp 


Thr Gly Phe Tyr 




115 




120 


Leu Val 


Asn 


Glu 


Glu Ala Thr Gly 


130 






135 


Pro Lys 


Pro 


Ser 


He Ser Ser Asn 


145 






150 


Asp Ala 


Val 


Ala 


Phe Thr Cys Glu 








165 


Leu Trp 


Trp 


Val 


Asn Asn Gin Ser 






180 




Leu Ser 


Asn 


Gly 


Asn Arg Thr Leu 




195 




200 


Asp Thr 


Ala 


Ser 


Tyr Lys Cys Glu 


210 






215 


Arg Ser 


Asp 


Ser 


Val He Leu Asn 


225 






230 


Thr He 


Ser 


Pro 


Leu Asn Thr Ser 








245 


Leu Ser 


Cys 


His 


Ala Ala Ser Asn 






260 




Val Asn 


Gly 


Thr 


Phe Gin Gin Ser 




275 




280 


He Thr 


Val 


Asn 


Asn Ser Gly Ser 


290 






295 


Asp Thr 


Gly 


Leu 


Asn Arg Thr Thr 


305 






310 


Glu Pro 


Pro 


Lys 


Pro Phe He Thr 



His 


Arg Trp 


Cys He Pro Trp Gin 




10 




15 


Leu 


Thr 


Phe 


Trp Asn Pro Pro Thr 


25 






30 


Thr 


Pro 


Phe 


Asn Val Ala Glu Gly 








45 


Asn 


Leu 


Pro 


Gin His Leu Phe Gly 








60 


Val 


Asp Gly 


Asn Arg Gin He He 






75 


80 


Ala 


Thr 


Pro 


Gly Pro Ala Tyr Ser 




90 




95 


Ala 


Ser 


Leu 


Leu He Gin Asn He 


105 






110 


Thr 


Leu 


His 


Val He Lys Ser Asp 








125 


Gin 


Phe Arg 


Val Tyr Pro Glu Leu 








140 


Asn 


Ser Lys 


Pro Val Glu Asp Lys 






155 


160 


Pro 


Glu 


Thr 


Gin Asp Ala Thr Tyr 




170 




175 


Leu 


Pro 


Val 


Ser Pro Arg Leu Gin 


185 






190 


Thr 


Leu 


Phe 


Asn Val Thr Arg Asn 








205 


Thr 


Gin 


Asn 


Pro Val Ser Ala Arg 








220 


Val 


Leu 


Tyr 


Gly Pro Asp Ala Pro 






235 


240 


Tyr 


Arg 


Ser 


Gly Glu Asn Leu Asn 




250 




255 


Pro 


Pro 


Ala 


Gin Tyr Ser Trp Phe 


265 






270 


Thr 


Gin 


Glu 


Leu Phe He Pro Asn 








285 


Tyr 


Thr Cys 


Gin Ala His Asn Ser 






300 


Val 


Thr 


Thr 


He Thr Val Tyr Ala 






315 


320 


Ser 


Asn 


Asn 


Ser Asn Pro Val Glu 
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325 330 335 

Asp Glu Asp Ala Val Ala Leu Thr Cys Glu Pro Glu lie Gin Asn Thr 

340 345 350 

Thr Tyr Leu Trp Trp Val Asn Asn Gin Ser Leu Pro Val Ser Pro Arg 

355 360 365 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu Ser Val Thr 

370 375 380 

Arg Asn Asp Val Gly Pro Tyr Glu Cys Gly He Gin Asn Glu Leu Ser 
385 390 395 400 

Val Asp His Ser Asp Pro Val He Leu Asn Val Leu Tyr Gly Pro Asp 

405 410 415 

Asp Pro Thr He Ser Pro Ser Tyr Thr Tyr Tyr Arg Pro Gly Val Asn 

420 425 430 

Leu Ser Leu Ser Cys His Ala Ala Ser Asn Pro Pro Ala Gin Tyr Ser 

435 440 445 

Trp Leu He Asp Gly Asn He Gin Gin His Thr Gin Glu Leu Phe He 

450 455 460 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu Tyr Thr Cys Gin Ala Asn 
465 470 475 480 

Asn Ser Ala Ser Gly His Ser Arg Thr Thr Val Lys Thr He Thr Val 

485 490 495 

Ser Ala Glu Leu Pro Lys Pro Ser He Ser Ser Asn Asn Ser Lys Pro 

500 505 510 

Val Glu Asp Lys Asp Ala Val Ala Phe Thr Cys Glu Pro Glu Ala Gin 

515 520 525 

Asn Thr Thr Tyr Leu Trp Trp Val Asn Gly Gin Ser Leu Pro Val Ser 

530 535 540 

Pro Arg Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu Phe Asn 
545 550 555 560 

Val Thr Arg Asn Asp Ala Arg Ala Tyr Val Cys Gly He Gin Asn Ser 

565 570 575 

Val Ser Ala Asn Arg Ser Asp Pro Val Thr Leu Asp Val Leu Tyr Gly 

580 585 590 

Pro Asp Thr Pro He He Ser Pro Pro Asp Ser Ser Tyr Leu Ser Gly 

595 600 605 

Ala Asn Leu Asn Leu Ser Cys His Ser Ala Ser Asn Pro Ser Pro Gin 

610 615 620 

Tyr Ser Trp Arg He Asn Gly He Pro Gin Gin His Thr Gin Val Leu 
625 630 635 640 

Phe He Ala Lys He Thr Pro Asn Asn Asn Gly Thr Tyr Ala Cys Phe 

645 650 655 

Val Ser Asn Leu Ala Thr Gly Arg Asn Asn Ser He Val Lys Ser He 

660 665 670 

Thr Val Ser Ala Ser Gly Thr Ser Pro Gly Leu Ser Ala Gly Ala Thr 

675 680 685 

Val Gly He Met He Gly Val Leu Val Gly Val Ala Leu He 
690 695 700 
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